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CHAPTER 1 



Introduction 



In the introductory Section, one is supposed to set the stage for the coming Chapters. This 
stage is the quantum many-body problem. Its importance hes in the fact that it provides the 
fundamental description of processes in fields as varied as atomic, molecular, solid state and 
nuclear physics. Apart from the thrill of exploring physical phenomena at the quantum level 
there is also the realization that predicting and manipulating such microscopic processes has 
powered much of the 20th century technology, and is likely to lead to further breakthroughs 
in our 21st century. 

In fact, the quantum mechanical description of many interacting particles is a problem 
that has been around since the dawn of quantum mechanics. Already in 1929 Dirac wrote [Ij: 

The underlying physical laws necessary for the mathematical theory of a large 
part of physics and the whole of chemistry are thus completely known, and the 
difficulty is only that the exact application of these laws leads to equations much 
too complicated to be soluble. 

What Dirac meant by this is that, in principle, the formalism to treat many-electron problems 
arising in the study of atoms, molecules and solids is completely known. Assuming that one 
can neglect relativity and that electrons interact solely through the Coulomb interaction, 
quantum mechanics provides the recipe by which the problem can be tackled. This recipe 
consists of solving the Schrodinger equation: 

H^ = E^ . (1.1) 

Mathematically, this is nothing but an ordinary eigenvalue equation in a linear space endowed 
with an inproduct, called Hilbert space in the context of quantum mechanics. The A^-particle 
wave function ^ is a vector in Hilbert space. The linear and Hermitian operator H is called 
the Hamiltonian, and contains all the information about the interparticle interactions. The 
eigenvalue E denotes the possible energies of the system. When identical particles are con- 
sidered, an extra permutation symmetry of the wave function ^ has to be imposed, better 
known as the Pauli-principle for electrons. 

The problem is that the direct application of this recipe is necessarily limited to small 
systems, because of the exponential scaling of Hilbert space with the number of particles 
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involved. This makes the solution of the eigenvalue problem in Eq. (1.1) problematic as it 
scales as 0{n^) where n is the dimension of Hilbert space. A linear spin chain, consisting of 
sites with spin- 2 degrees of freedom, provides a simple example that illustrates this dramatic 
scaling of Hilbert space. A single spin- 2 site is described by a wave function consisting of two 
components, a spin-up and a spin-down term: 

|M/) = Ct|t> + qii). (1.2) 

Only two complex numbers Ca are needed to completely characterize this state. For describing 
two sites one requires: 

1^) = Cnl tt) + c^tl it) + Cu\ U) + Cu\ U) , (1.3) 

so four numbers are needed. It is easy to appreciate that for an L-spin state the general 
wavefunction is: 

1^) = Yl C^i...^lWi,---,^l) , (1-4) 

and one requires 2^ numbers to completely describe the state. This observation led Dirac to 
remark: 

It therefore becomes desirable that approximate practical methods of applying 
quantum mechanics should be developed, which can lead to an explanation of the 
main features of complex atomic systems without too much computation. 

Over the last eighty years a great variety of such approximate methods have been developed 
[11 El m El E]: from perturbation theory and cluster expansions over self-consistent field 
and variational methods to renormalization group methods and stochastical techniques like 
Quantum Monte Carlo. Most of these approximate methods try to somehow capture the 
relevant information, present in the wave function, in a reduced object. 

If the quantum many-body problem is the stage, the main protagonist in this thesis is 
the reduced density matrix. The reduced density matrix method discussed in this thesis is 
an approximate method that tries to replace the wave function, with its exponentially scaling 
number of variables, with the two-particle density matrix (2DM), for which only a quartically 
scaling number of variables are needed. This is a very efficient reduction since these are just 
the degrees of freedom needed for the exact evaluation of the energy. 

The reduced density matrix makes its first appearance in the work of Dirac, in which the 
single-particle density matrix (IDM) is used in the description of Hartree-Fock theory [7]. 
Husimi [8] was the first to note that, for a system of identical particles interacting only in a 
pairwise manner, the energy can be expressed exactly as a function of the 2DM. This becomes 
very clear in second-quantized notation (see e.g. OE]), where a system of identical particles 
interacting pairwise is described by the general Hamiltonian: 

H = ^ia/jaJjC^ "^ 4 X^ Vai^-'ysaWisO-sa^ ■ (1-5) 

a/3 afi-yS 

The expectation value for the energy of any ensemble of A^-particle wave functions l^f') with 
positive weights Wi, can then be expressed as a function of the 2DM alone: 

5:^.(^f l^l^f ) = TV TH('^ = I Y: ^^P-nsH^'L ' (l'^) 

i af3'^/5 



in which we have introduced the 2DM: 

ra/3;-y5 = ^Wi(^f I aj^a^a^a^ I ^f) , with ^^^ = 1, (1.7) 

i i 

and the reduced two-particle Hamiltonian, 

(2) 1 

^a/Si-yS ~ JY _ 1 i^ay'^/SS — ^aStfS^ — SjS^taS + ^/BStaj) + Kt^^^^ . (1.8) 

The idea to use the 2DM as a variable in a variational scheme was first published in literature 
by Lowdin in his groundbreaking article [9j, but even earlier, in 1951, John Coleman tried a 
practical variational calculation on Lithium. To his surprise, the energy he obtained was far 
too low, after which he realized the variation was performed over too large a class of 2DM's 
|10] . Independently and unaware of the work by Lowdin and Coleman, Joseph Mayer |llj 
used the 2DM in a study of the electron gas. In a reply to Mayer's paper, Tredgold [12] 
pointed out the unphysical nature of the results, and suggested that additional conditions on 
the density matrix are needed to improve on them. 

These results led Coleman, in his seminal review paper [13] , to formulate the iV-represen- 
tability problem. This is the problem of finding the necessary and sufficient conditions which 
a reduced density matrix has to fulfil to be derivable from a statistical ensemble of physical 



wave functions, i.e. expressible as in Eq. ( 1.7). In this paper he also derived the necessary and 



sufficient conditions for ensemble A^-representability of the IDM (see also Section 2.2.1), and 
some bounds on the eigenvalues of the 2DM. A big step forward was the derivation of the Q and 
Q matrix positivity conditions by Garrod and Percus [14| . These were practical constraints, 
which allowed for a computational treatment of the problem. The first numerical calculation 
using these conditions on the Beryllium atom |15|ll6j was very encouraging, as the results were 
highly accurate. It turned out, however, that Beryllium, due to its simple electronic stucture, 
is a special case where these conditions perform very well. A subsequent study showed that 
these conditions do not work well at all for nuclei |17t[T8j. This disappointing result, together 
with the computational complexity of the problem, caused activity in the field to diminish 
for the next 25 years. The change came with the development of a new numerical technique, 
called semidefinite programming, which turned out to be very suited for the determination 
of the 2DM under matrix positivity constraints. Maho Nakata et al. flO] were the first to 
use a standard semidefinite programming package to calculate the ground-state energies of 
some atoms and molecules, and obtained quite accurate results. He was quickly followed by 
Mazziotti |20j . These results reinvigorated interest in the method, and sparked of a lot of 



developments. New A^-representability conditions were introduced, e.g. the three-index T 



conditions (see Section 2.2.2.2), set forth by Zhao et al. [21], which led to mHartree accuracy 
[221 [23l [211 [25l [261 IZIj for molecules near equilibrium geometries. 

In recent years interest in the method has been growing, as the variational determination 
of the 2DM results in a lower bound, which is highly complementary to the upper bound 
obtained in variational approaches based on the wave function. In addition, the method is 
essentially non-perturbative in nature, and has a completely different structure unrelated to 
other many-body techniques. A lot of activity has been devoted to the search for new N- 
representability conditions, which improve the result in a computationally cheap way [281I29J . 
There have also been efforts to improve the semidefinite programming algorithms by adapting 
them to the specific problem of density matrix optimization [301 l3Tl [32] , allowing the study 
larger systems. 



4 CHAPTER 1. INTRODUCTION 

In this thesis we discuss the framework of density matrix optimization, its apphcations 
to physical systems, and the contributions we made to the field. In Chapter [2] the A^- 
representability problem is introduced. We start by discussing its formal definition, and 
how useful necessary constraints can be derived. A non-exhaustive overview of what is known 
about A^-representability is provided, and some of the non-standard approaches we developed 
are discussed. The next Chapter is devoted to the semidefinite formulation of density matrix 
optimization. We present a detailed analysis of the three different semidefinite programming 
algorithms that were developed during the PhD. A comparison is made of their performance 
using the one-dimensional Hubbard model as a benchmark. Chapter |4] is rather technical, and 
deals with how the semidefinite programming algorithms can be made more performant by 
exploiting symmetries. In Chapter [5] we present the results that are obtained when applying 
this method to a variety of physical systems, and how new A^-representability constraints are 
derived when the results are not satisfactory. In the application of the method to the one- 
dimensional Hubbard model a drastic failure of the standard two-index A^-representability 
constraints in the strong-correlation limit is observed. An analysis is made of why these 
constraints fail, and what the relevant correlations are that need to be included in order to 
fix the problem. In Chapter [6] an approach is presented which includes these correlations, 
without becoming computionally too expensive. We present results which show that the 
strong-correlation limit is well described using this approach, and that the quality of the 
results is improved for the whole phase diagram. Finally, in Chapter [7] we draw conclu- 
sions about the method, discuss what is needed to make it a competitive electronic structure 
method, and provide ideas for future research. 



CHAPTER 



A^-representability 



In the introduction we introduced the A^-representabihty problem, and how it was discov- 
ered historically. In this Chapter we try to give an overview of what is known about ex- 



act and approximate A^-representability of reduced density matrices. In Section 2.1 the A^- 
representability problem is placed in the broader context of what is known in mathematics 
as marginal problems, and different definitions of A^-representability are introduced. In Sec- 



tion 2.2 we proceed by deriving the necessary and sufficient A^-representability conditions for 
the one-particle density matrix, and some necessary conditions on the two-particle density 
matrix that are well known and frequently used. The last part of this Chapter deals with 
non-standard A^-representability constraints, such as subsystem constraints or diagonal con- 
ditions. For more information on some of the mathematical concepts used in this Chapter we 
refer to Appendix \A\ 



2.1 Definitions of A^-representability 

The A^-representability problem, as introduced by Coleman p^, is actually a special case 
of a set of problems known in mathematics as marginal problems [33]. Given a probability 
distribution with A^ variables, p{xi, . . . ,xn), a A;-marginal distribution is defined as: 



p{xi^,...,Xi^^)= J2 p{xi,...,xn) ■ (2.1) 



The classical marginal problem can be formulated as the question what the conditions are 
that a set of I , 1 /c-marginal distributions has to fulfil to be consistently derivable from an 



A^- variable probability distribution as in Eq. (2.1). The logical quantum extension to this 



problem is to replace the probability distribution with a wave function: 

p{xi, ..., xn) ^^p*{xl,..., XN)tpixi, ..., xn) ■ (2.2) 

5 
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The right-hand side of Eq. (2.2) is just the A^-particle density matrix (A^DM) expressed in 
some basis |xi . . . xn) as introduced by Johann Von Neumann [34J: 

^D = ^Wi|^f)(^f| with weights Wi > and ^Wi = 1 . (2.3) 

i i 

He considers the A^DM to be an object better suited for quantum mechanics, because it 
describes a system as a statistical ensemble of pure states, as opposed to the wave function 
framework which can only handle pure states. This is an advantage, because in reality a 
system is entangled with its environment, and a measurement on the system is described by 
a statistical ensemble of states, and not by a pure state. Because of the description of an 
A^DM as a statistical ensemble of pure-state density matrices, the set of A^DM's is convex (see 



Appendix A|. From its definition in Eq. (2.3) it is clear that an A^DM must be Hermitian, 



positive semidefinite and have unit trace. As with classical probability distributions, we can 
define marginal or reduced p-density matrices (pDM), where (N — p) particles are traced out: 

]\f-L'ai-^...aip;fBi-^^...l3ip = 2_^ -L^Ai...Ai^_iaij Aij+i...;Ai...Aij_i/3i^ Ai^+i... ■ l^-^j 

Ai...Aij_iAi-|^+i... 

The quantum marginal problem can be defined as establishing the conditions tha t a set of 
) pDM's must fulfil to be consistently derivable from an A^DM as in Eq. (|^ 



2.4) [35]. For 

systems of identical particles, fermions or bosons, every pDM derivable by (2^) has to be 
the same, because they can be mapped onto each other by a simple permutation of the 
indices. The A^-representability problem is just the quantum marginal problem limited to 
systems of identical particles. Rewritten in the language of second quantization, a pDM is 
A'"-representable if it can be derived from an ensemble of A^-particle wave functions: 

N^a^...ap;i3i...i3p = ^Wi (^f |aj,^ . . . oj^^a^^ . . . a^J^f ) . (2.5) 

i 



When looking at Eq. (2.5) we see that the pDM is Hermitian and positive semidefinite, the 



trace is tjtt^^ but it is not at all clear which additional conditions a physical pDM has to 
satisfy. In physical terms, the question we are asking is: what are the constraints that are put 
on p-particle properties of a quantum system when it is part of a larger system of A'^ identical 
particles. 

2.1.1 Dual definition of A^-representabifity 

Using the variational principle, and the fact that the set of A^-representable pDM's is convex, 
we can derive an alternative definition of A'^-representability, which turns out to be very useful 
to derive necessary conditions. Let's first define a real symmetric p-particle operator: 

HiP) = Y^ Y: h!^U;,....,A^ ■ ■ ■ <^P. • • • «/^i • (2-6) 

ai...ap/3i.../3p 

The expectation value of this operator in any statistical ensemble of A^-particle states can be 
expressed using only the pDM derived from this state: 

i ai...«p/3i.../3p 



Tr [^ri/(P)] . (2.7) 
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7 



Being an expectation value, the trace on the right of Eq. (2.7) cannot be lower than the lowest 
eigenvalue Eq (^H^^'^. A lower value therefore implies that the ^F used cannot be physical. 
As such the first part of our proof can be stated: if a pDM is A^-representable, then for all 
possible p-particle operators H{,' 



(p). 



TV 






> e!" h^^ 



(2. 



Suppose that we are given a pDM, ^F*, which is not A^-representable. From the separating 
hyperplane theorem for convex sets |36i| it follows that a such a pDM can always be separated 
from the A^-representable set by a hyperplane. A hyperplane C in pDM space (see Appendix 
A| is defined by a number x and a matrix O^^' as: 



%VeC 



if 



Tr 



^F 0(P) 



(2.9) 



The fact that a non A^-representable pDM is always separated from the A^-representable set 
by a hyperplane means that one can always find a p-Hamiltonian for which: 



Tr 



p r* ff (p) 



whereas for all the ^F in the A'^-representable set: 



Tr 



^F //(P) 



< X , 



> X . 



This means that: 



Tr 



^F* H^P^] <x<E^ (h^pA < Tr [^F H^p'> 



(2.10) 



(2.11) 



(2.12) 



This implies that for a non A^-representable pDM, we can always find a p-Hamiltonian for 
which the expectation value of the energy is lower than the ground-state energy of the Hamil- 
tonian. The dual definition of A^-representability can now be stated as: 



Theorem 1. A pDM is N -representable if and only if 

Tr 



P r h'^p) 



>E^(h(p') 



(2.13) 



for all p-particle Hamiltonians He! 



(p) 



In Fig. 2.1 an artist impression of this theorem is shown. The image is trying to convey 



the idea that the convex A'^-representable set is created by an infinite number of hyperplanes, 



which are defined as in Eq. (2.9) by all possible p-Hamiltonians and their ground-state ener- 



gies. 



2.2 Standard A/^-representability conditions 

In the previous Section we discussed the origin of the A^-representability problem, and pro- 
vided two equivalent definitions. In the first, Eq. (2.5), the A^-particle wave function is 



referenced. It would of course be senseless to replace the wave function by the density ma- 
trix in a variational approach, and then reintroduce the wave function in order to impose 
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< (h^'' 



< (h^'' 



Figure 2.1: Schematic representation of Theorem [T] it shows the 7V-representable region 
demarcated by an infinite number of hyperplanes defined by p-Hamiltonians and their ground-state 
energies. 
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Tr ^rF(p) 



P p 



nee 




Tr %TH^P^ 



E. 



Hip) 



nee 



Figure 2.2: The set of pDM's which satisfies the necessary TV-represent ability conditions is larger 
than the exact set, which means that the optimization of a 2DM under these constraints, for a 
certain Hamiltonian H^p\ leads to a lower bound in the energy: E^cc < ^-oxact- 



A'^-representability. The second definition, in Eq. (2.13), is no better in a practical sense, as 



it requires the knowledge of the ground-state energy of all p-particle Hamiltonians, which is 
of course exactly what we are trying to find. 

In this Section we derive some useful necessary A^-representability constraints which are 
used in practical calculations. A necessary constraint is valid for all A^-representable pDM's, 
but it could be valid for non A^-representable pDM's too. E.g. all A^-representable pDM's are 
antisymmetrical in their single-particle indices, but not all p-particle matrices that are anti- 
symmetric in their single-particle indices are A^-representable. It follows that if we optimize 
a pDM over a limited set of necessary constraints the resulting energy is a lower bound to 
the exact energy, because we have varied over a set that is too large, as can be seen in Figure 



2.2.1 A^-representability of the IDM 

The first practical A^-representability conditions were derived for the IDM |T3]. This object 
is used so frequently we introduce a special symbol p for it: 



Po 



1 r 

V-L 



AT-L a/; 



Y,m{^f\aUf^\^f) 



(2.14) 



At this point it is interesting to note that A^-representability is invariant under unitary trans- 
formations of the single-particle basis, as is seen from the definition of the reduced pDM 
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in Eq. (2.4). This means that when we have an A^-representable pDM, we can rotate the 
single-particle basis at will, and stay inside the A'^-representable set. A consequence for the 
IDM is that the necessary and sufficient A^-representability conditions must be expressible as 
a function of its eigenvalues alone! 



From Eq. (2.14) we can derive some obvious necessary conditions on the IDM, i.e.: 

Tr p = ^p„« = iV, (2.15) 



a 



Pap = Pl3a , (2-16) 

P h . (2.17) 

A more systematic way of finding necessary conditions is through the dual definition of A^- 



representability in Eq. (2.13). The idea is to use a class of Hamiltonians for which a lower 
bound to the groundstate energy is known, and use this to obtain necessary constraints on 
the pDM. One such class consists of the manifestly positive Hamiltonians, for which we know 
the ground-state energy is larger than zero: 

Hpos = Y.BlBi, (2.18) 

i 

in which 13' is a p-particle operator. For the IDM there are two independent classes of 
Hamiltonians that can be constructed: 

1. W = "^^PaO-a leads to the condition: 

J]p„(^^|at,a^|^>^ > , (2.19) 

a/3 



which is equivalent to the already stated positivity condition on the IDM (2.17). 
2. B^ = ^^ QaCia leads to the condition: 

^g„(^^|a„ajj|^^)(?^>0. (2.20) 

0/3 

This is a new matrix positivity condition, which can be expressed as a linear map of 
the IDM through the use of anticommutation relations: 

q^O where qip)ai3 = ^ajB - Pap ■ (2.21) 

The interpretation of this condition is that the probability of finding a hole in a par- 
ticular single-particle state has to be positive. Equivalently, the number of particles 
occupying an orbital can't be larger than one. 

These two conditions combined lead to the result that the eigenvalues of a fermionic IDM 
have to lie between and 1, which is just an expression of the Pauli exclusion principle. 

We now have derived necessary conditions for the IDM. Are these sufficient to ensure 
A"-representability? To prove this it is instructive to look at the structure of a IDM derived 
from a Slater determinant. A Slater determinant has N occupied single-particle orbitals, so 
the IDM has N eigenvalues equal to 1, and the rest is zero. An arbitrary IDM that satifisies 
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(|2.17|) and (|2.21|) has eigenvalues between and 1. It has been proved that one can always 

From 



find an ensemble of Slater IDM's which has the same eigenvalues and eigenvectors [T^ 

this fact it follows that the conditions are also sufficient, because every IDM that satisfies 

them can be derived from an ensemble of Slater IDM's. 

Finding the ground-state energy of an arbitrary single-particle Hamiltonian H{, can now 
be reformulated as a variational problem: 



E"" (h(P 



u.c.t. 



minTr 
p 



(2.22) 



(2.23) 



phO, 
q{p) t , 
Tr p = N . 

In the ground-state IDM, the particles are distributed over the A^ lowest eigenstates of the 



r(i) 



Hamiltonian H^ , because of the conditions (2.23). The energy expectation value of two 



particle Hamiltonians cannot be expressed using the IDM alone, one also needs the 2DM. We 
can however construct an uncorrelated 2DM out of a IDM: 



N 



Tal3;-fS = (P A p)al3-^S = Pa-yP^S - PaSP/S-y 



(2.24) 



Using this object in a variational optimization under the constraints (2.23) leads to exactly 
the same result as with the Hartree-Fock approximation [3j. 



2.2.2 Necessary conditions for the 2DM 

In the previous Section we have seen that is relatively easy to derive the necessary and 
sufficient conditions for the IDM. This should give us hope that it won't be that hard to derive 
conditions for the 2DM. Unfortunately it is almost impossible to derive necessary and sufficient 
conditions for the 2DM. In fact, the A'^-representability problem for the 2DM has been proven 
to belong to the Quantum Merlin- Arthur (QMA) complete complexity class [37]. This is not 
surprising, as we have seen in Theorem [T] that the A/^-representability problem is equivalent to 
determining the ground-state energy for all possible two-particle Hamiltonians. Finding the 
ground-state energy for only one two-particle Hamiltonian already scales exponentially with 
system size, which gives an indication of the complexity of the A^-representability problem. 
In this Section we show how people try to get around this and derive some frequently used 
necessary conditions on the 2DM. From now on we drop the indices on F for the 2DM and 
define it as: 

(2.25) 



o/3;75 



AT-L a/3;7(5 



E-^^{^^ 



J fol^a^asa^l^ 



N\ 



Immediately one sees some necessary conditions from Eq. (2.25): 

N{N 



Tr r 



oIZ^"/^ 







a/3 


a/3;7(5 


= 


^ yS\aP ) 


al3;-y5 


= 


—Ti3a;^5 


F 


>- 


. 



a/3; 57 



1) 



fJa-jSy 



(2.26) 

(2.27) 
(2.28) 
(2.29) 



These are, however, not at all sufficient to give a good approximation to the ground-state 
energy. For deriving further necessary conditions we use the same method as for the IDM, 



using positive Hamiltonians with the structure of (2.18) 
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2.2.2.1 Two-index conditions 

Combining two creation and/or annihilation operators yields four different forms of the W 
operator. 

The I condition: the Hamiltonian constructed from 13' = ^aRPap^aO/a has to have an 
expectation value larger than zero: 

Y, Pap{^^\aia\asa^\^^)p-^5 > . (2.30) 

This is just an expression of the fact that T has to be positive semidefinite, which we already 
knew. Physically speaking, it is the expression of the fact that the probability of finding a 
two-particle pair is larger than zero. We refer to this condition as the I condition, for the 
identity condition. 

The Q condition: in exactly the same way B"^ = J2af3 lajiO-aOip leads to: 

Yl qap{^''\aaapala\\^!^)q^s > , (2.31) 

aji-yS 

expressing the positive semidefiniteness of the two- hole matrix, i.e. the probability of finding 
a two-hole pair has to be larger than zero: 

Q^O where Qa/3;75 = X^^^^^^I^^^/^^Ml^f") ' (2-32) 

i 

as first derived by C. Garrod and J. K. Percus in [14j. This matrix can be rewritten as a 
function of the 2DM and the IDM by reordening the creation and annihilation operators 
using anticommutation relations, leading to the following linear matrix map: 

Q(r)a/3;75 = ^a'^^pS " ^p-y^aS + ^ al3;'yS " {^a-yPjiS " ^aSP/S-y " ^PyPaS + ^/SSPa-y) ■ (2.33) 

One can see that the combination of the I and Q conditions already insures that the Pauli 
principle is obeyed, as the p and q condition can be derived from them by tracing one index: 



P»i - ^ 



^5]r,^;7/3, (2.34) 

^^3^j;Q(r)«/3;7/3, (2.35) 



in which M is the dimension of single-particle space. It is seen that p and q are automatically 
positive semidefinite if I and Q are. This, however, does not mean that enforcing the I and 
Q conditions leads us to a better result than Hartree-Fock. The set over which we are varying 
is too large, so we get a lower bound to the ground-state energy. Applying only X and Q 
can lead to considerable overestimation of the correlation energy, defined as the difference 
between the exact and the Hartree-Fock energy. 

Two more constraints can be defined in the same way, using aa' or a' a operators. They 



were first derived in [14j, but in a different form (see Section 2.2.2.3) 
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The Qi condition: the particle-hole operator W = X^a/? daB^aO-p leads to the following 
positivity expression: 

^ gi^{^''\aiapala^\^'')g}^s > , (2.36) 

which is a mathematical translation of the fact that the probability of finding a particle-hole 
pair must be larger than zero. We call this condition the Qi condition: 

^1^0 where (^O.^.^^ = ^u;.(^f la^a/jaja^lv^f ) , (2.37) 

i 

which can again be rewritten as a matrix map of the 2DM using anticommutation relations: 

^1 (r)a/3;^5 = SfSSPay - Ta5;-yl3 ■ (2.38) 

Note that there is no symmetry between the single-particle orbitals here. This condition turns 
out to be much more stringent than the X or Q conditions. The combined conditions IQGi 
are known as the standard two-index conditions. Applying these can already lead to very 
good approximations for some systems, e.g. the Beryllium atom. For other systems, however, 
additional constraints are needed to obtain decent results. 

The G2 condition: the only remaining combination of creation an annihilation operators 
is i^t = Y.ap al^aao'i^, leading to: 

^ gl^{^''\aaalasa\\^'')g^^s > , (2.39) 

i.e. the probability of finding a hole-particle pair has to be positive. We call it the Q2 
condition: 

^2^0 where {g2)^p.^s = J2w,{^f^\aaalasa\\^f^). (2.40) 

i 

Expressed as a function of the 2DM this becomes: 

^2 {^)al3;'r5 = ^a/S^-yS - ^a/SP'yS - S^SPa/S + ^a-yPpS - Ta5;yl3 ■ (2.41) 



As it turns out this is not an independent condition, as is shown in Section 2.2.2.3 



2.2.2.2 Three-index conditions 

As mentioned in the previous paragraph, the two-index conditions sometimes give a very good 
result, but often stronger conditions are needed describe systems correctly. One example of 
stronger conditions are the so-called three-index conditions. They are also derived from pos- 



itive Hamiltonians of the type (2.18), but slightly diff^erent from those used in the previous 
paragraph. For these conditions, use is made of the fact that taking the anticommutator of 
fermionic three-particle operators lowers the rank by one, and the expectation value is there- 
fore expressible as a function of the 2DM alone. The general form of the positive Hamiltonian 
used here is: 

H = B^B + BB^ , (2.42) 

in which the B^^s are combinations of three creation and/or annihilation operators. There 
are three independent conditions that can be derived in this way. These conditions were first 
alluded to in the work of R.M. Erdahl [38j, and explicitly used in PHI22]. 
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The 71 condition If W = X^„a^ ^ag-y^°'^B^\ ^^^ positivity of the Hamiltonian leads to the 
following expression: 

E E ^U, ((*^l44«7«C«ea5|*^) + (^^kca^a^at^ajat 1^^)) 4^ > . (2.43) 

This expresses the positivity of the 7i matrix defined as: 

Ti{T) t (2.44) 

with 

iri)o.grM = E^* [{^f\aiala\aca,as\^f) + {^f\aca,asaUla\\^f)) . (2.45) 

i 

Due to the anticommutation of the B' and B this expression can still be written as a function 
of the 2DM when we anticommute the creation and annihilation operators: 

- iS.y(6i3^ - 5i3(^6^e) PaS + iS-f(Sae - Sa(^6^e) PpS " {^PC^ae - 5aC,^l3e) P-yS 
+ {^-^C^PS - ^PC^-rS) Pat - {Si-yC^aS " '^Q:C'^7'5) P<^P + i^PC^aS " ^aC^ps) P-ye 

- i^pS^^e - Si3eS^s) PaC, + {^-yeliaS " ^ae^^ys) PpC, " {^Pe^aS " ^ae^ips) P-fC, 
+5^(Tap-^Se — ^p(,^ar,Se + ^acj^ PrM ~ ^1(^aP;5C + ^Pe^arM ~ ^at^ PrM 
+5-^5^ aP;<iC — 5p5^a"i;eC, + ^aS^ P"i;eC ■ (2-46) 

Note that this is a matrix on three-particle space, and as such will be computationally much 
heavier to use. Using this condition does not make the X or Q condition redundant, since 71 
implies positiveness of the sum BB'^ + B'^B and not of the individual terms. 



using ^t = Z^afl^t^^ aliala^, producing the inequality: 



The 72 condition The next three-index condition is derived trough the positivity of (2.42) 

■iP-f ''aPy"'0!"'P 



a/37 ^^C 

This expresses the positivity of the sum of the particle-particle-hole and the hole- hole-particle 
matrix 72: 

iT2)aPrM = E ^* [{^f\aiala^ala,as\^f) + (^f laJa^a^aJ^aJ^a^lM^f )) , (2.48) 

i 

which, following the standard recipe, becomes an expression of the 2DM alone: 

T2{T)aPr,Se( = i^aS^Pe — ^ae^ps) P-yC + ^■yC^ap;Se 

— 6aS^'ye;(;P + Sp5^'ye;Ca + ^ae^'y5;(;P — Spe^'yS;(;a ■ (2.49) 

This is a more compact expression than the 71, but it turns out to be a stronger condition, 
much like the Q condition was more stringent than I and Q. 
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The Ts condition The last independent combination of creation/anniliilation operators is 



W = YlaB-y^aB'y^'^^P^'y^ which gives tlie following positivity expression: 



EE*'/37 ((^"^kia^atacata^l^^) + (^^la^ata^a^a^at |M.^)j 4^ > . (2.50) 

a/37 Se(^ 

The above equation expresses the positivity of the sum of the particle-hole-particle and the 
hole-particle- hole matrix T3: 

iT3)aPrM = Y.'^i {{^^\aUpa\a^cilas\^f) + {^f\a^alasaia^a\\^^)) . (2.51) 

i 

Once more, the anticommutation of the creation and annihilation operators enables us to 
write this as a function of the 2DM alone: 

T3{T)al3'y;5eC = ^aS^fB^y^e^ — ^aSTej-pC ~ ^aC^n;Sl3 — ^■y5Tae;f5C + Sf3eTa'y;5( 

— ^■yC^ae;5(5 — ^aS^eCPlS-y — ^aS^^pPeC, + ^aC^-y(5Pe5 + ^ySSeCPa^ 

+ {da5S'y<: - <JaC^75)/0/9e • (2.52) 

In the next Section, however, we show that this condition becomes redundant the Ti condition 
is generalized. 

2.2.2.3 The primed conditions 

Up to now, we have derived necessary constraints using positive Hamiltonians, in which 
the positive Hamiltonians were built with some combination of creation and annihilation 
operators. In some cases it is possible to derive more stringent conditions, called primed 
conditions, involving linear combinations of such operators. The first condition of this type 
to be derived was by Garrod and Percus [Hj , in what was also the first derivation of a ^-like 
condition. 

The Q' condition The idea of primed conditions is to generalize the operator B'. For the 
G' condition it becomes: 

B^ = Y,gai3aiap + C , (2.53) 

a/3 

in which C is a constant operator. The positivity of the Hamiltonian constructed with this 
B' leads to the following expression: 

i a/37<5 a/3 

+C^g:,sP,s + C^>0, (2.54) 

75 

which can be rewritten as: 



Yl 3'al3 (^l)a/3;7<5 9-y5 + {C + ^ 9'al3 Pa/3 1 " ^ Oafi Paf^p-yS 9^6 > ■ (2.55) 

al3-f8 \ a/3 / 0/375 
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It is clear that we get the most stringent condition if we choose: 

C = - 2_] ^c./3 Pap , 

aP 



(2.56) 



since this removes a strictly positive contribution to Eq. (2.55). The remaining terms in 



Eq. (2.55) then express the positive semidefiniteness of the matrix: 

Q' (r)a/3;75 = hsPa-y " '^a5;'^l3 " Pa^P-yS ■ 



(2.57) 



From the definition (2.53) it should be clear that when the G' is fulfilled, both Gi and G2 will be 



fulfilled too, since these are just special cases of G' with a specific value for the number C. The 
Gi condition is G' when C = 0. For the G2 condition it can be seen through anticommutation 
that: 



with 



i?t 


\ ^ 2 t 

= Z^9apaaa'f^ 




a/3 




= ^alp^ap-^glpaiap 




a/3 ap 




= y_^9apO'iap + C 




a/3 


5a/3 = 


-5a/3 and C = ^ ola 



(2.58) 



(2.59) 



Unfortunately G' is not linear in F which hinders its use in a standard semidefinite program. 
However, one can show that the domain in 2DM space for which G' is positive is exactly the 
same as that where Gi is positive, i.e. 

Theorem 2. The nullspaces of the G' and Gi matrix map coincide. 

Proof. In the first part of the proof we show that when, for some F, Gi (F) has a zero eigenvalue, 
then ^'(F) has a zero eigenvalue too. Suppose ^i(F) has a zero eigenvalue with eigenvector 
v. 



iGl{T)v)^l^ = 2J i^pSPaj - ra(5;7/3) V-yS = 2j PayV-^p " 2j ^aS-^P V^s = Q . 
jS 7 7<5 



If we trace Eq. (2.60), we get 



N^payl 



'■ya 



. 



07 



This means that ^'(F) also has a zero eigenvalue with the same eigenvector: 

{G'{T)v)^^ = Y^ Gl{T)ap-y5 V^S - PaP ^ Py6 V^S = . 

7(5 7(5 



(2.60) 



(2.61) 



(2.62) 



In the second part we prove the inverse. Suppose ^'(F) has a zero eigenvalue with eigenvector 
V, then ^i(F) also has a zero eigenvalue, with eigenvector v': 



1/3 = VaP 



Tr pv 

N 



Sap ■ 



(2.63) 
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First we note that v' is also an eigenvector of G' because: 



E^'(r; 



OiP-m 



. 



(2.64) 



Next, we multiply Qi with v': 

■yd 



Y^ {G'iT)al3j5 + PapPys) v\s 

7(5 

Tr pv^ 



/, Pa(5P-y5 

■yS 



V^S 



N 



■J-yS 



. 



(2.65) 



D 



From Theorem [2] we learn that using Q' or Qi during the optimization yields the same 
results, since the edges of the convex sets for which ^i ^ and Q' ^ are the same! The Qi 
condition is to be preferred, because it is linear in F and therefore easier to use in semidefinite 
programs. At the same time one sees that G2 is redundant, because it is included by Q' , 
and following from the above theorem, also by Qi. So from now on we refer to Gi as the G 
condition and never use Q2 again. 

The 72 condition A primed condition can also be derived for the three-index case, through 
a generalization of the B' for the Ti condition, (see [27^ I39j). The positive Hamiltonian has 
the form: 

H = B^B + BB^ , (2.66) 



in which B"^ is the generalized W of the Ti : 






(2.67) 



To it is added the regular second part of the 72 Hamiltonian, to cancel out the three-particle 
terms appearing in the first part. The expression of positivity then leads to: 



Y Wii'i^flHl^f') = Y.Y1 tliSyT2{r)afirM 4c + Yl tlp^^apr,^x, 

i a/37 "^^C afiyv 



uSeC 



111/ 



where 



(2.68) 



(2.69) 



Wa/37;i. = ^'Wi(^f |aj,a|,a^ai.|*f ) = Ta/S-u-y ■ 
i 

The positivity expressed in Eq. ( 2.68| ) can be translated to a matrix positivity condition: 



u 



/i;5eC 



Pfif 



One can see from Eq. (2.70) that this matrix has a slightly larger dimension than the regular 



72 condition, but this is negligible compared to the size of the 72 matrix. It is also obvious 
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that 7^' includes 72, as diagonal blocks of a positive definite matrix are also positive. What 
is more, the Ts can be shown to be a part of the 72 class too, since the B' that creates the 

73 condition is a special case of the general 7^' W : 

a/37 

= Y^ 5/3^tlf^^al - Y^ tlp^ala\a/B (2.71) 

0/37 a/37 

= Y *a/3744'^7 + 5Z ^('"'i ' (^•'^^) 

a/37 A* 

in which 

*a/37 = ~*a7^ ^^^ ^M = ^ */i/3/3 ■ (2.73) 

As reported in [271 EH] , slightly better results are obtained when adding the 7^' condition 
compared to only applying the 71 and T2 conditions. 

2.3 Non-standard A/^-representability conditions 

Up to now we have focussed on constraints which can be expressed as matrix positivity 
constraints of linear maps of the 2DM. These constraints are commonly used in 2DM opti- 
mization. In some situations the two-index conditions suffice to obtain a good approximation 
of the ground-state properties, but often the results are not good enough. In a standard 
approach one would use the three-index conditions to increase accuracy, but this limits the 
size of the systems that can be studied as the computational cost increases dramatically. In 
this Section we introduce some non-standard A^-representability conditions that can be used 
to increase accuracy without making the optimization computationally unfeasible. 

2.3.1 Subsystem constraints 

The first example of non-standard constraints we introduce are the subsystem constraints [29] . 
These constraints were developed for the special case of diatomic dissociation (See section 



5.2), but are more generally applicable. In this Section we derive these constraints without a 
specific application in mind. The idea of subsystem constraints is to impose constraints on the 
2DM restricted to some subspace of single-particle Hilbert space. To derive them however we 
first need to introduce the concept of A^-representability for systems with a fractional number 
of particles. 

2.3.1.1 Fractional A^-representability 

Until now we have only discussed systems with an integer number of particles. When we talk 
about a fractional particle number we don't mean that the particles are actually split up, but 
that the expectation value of the 'number of particles'-operator is fractional. This means that 
the system is open, it can exchange particles with its environment, so the number of particles 
in the system fluctuates. The natural way to deal with these kinds of systems is by taking an 
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ensemble of von Neumann density matrices with different particle numbers: 

^r = ^xjvl^f )(^f I where xjv > , ^x^ = 1 and ^x%N = N . (2.74) 

Ni Ni Ni 

The obvious generalization of Eq. ( |2.74[ ) to reduced density matrices is: 

^r,,...a,;/3,.../3, = ^ ^V(*f K • • • <«/?, • • • «/3i l^f > , (2-75) 

Ni 

in which the x]^^s satisfy the same relations as for the iVDM. It is important to note that it 
is not possible to derive the IDM from the 2DM as with integer A^ density matrices. When 
a fractional- A^ system is described by a Hamiltonian containing both one- and two-body 
interaction terms, one needs both the IDM and the 2DM to express the energy. For these 
systems it is therefore more natural to consider the pair (p, F) as fractional- A^ representable, 
if they can both be derived by an ensemble as in Eq. (2.75) with the same weights x^. 



The set of fractional- A^ representable (/9, F)'s is obviously convex, and the whole argu- 



mentation of Section 2.1.1 is applicable here as well. The dual definition of fractional A^- 
representability can therefore be immediately stated. The pair (p, F) is fractional- A^ repre- 
sentable if, and only if, for all: 

-^ = ^ taiBaiap + T X] VaiS'-ydaialasa^ , (2.76) 

a/3 afi-yS 

the energy expressed in terms of {p, F) is larger than the ground-state energy of this Hamil- 
tonian for an ensemble with average number of particles N: 

Tr tp + Tr FF > E^ (h) . (2.77) 

2.3.1.2 Subsystem 2DM's are fractional-A^ representable 

Let us now take an arbitrary subspace, V, of single-particle Hilbert space. We use Latin 
letters (a,6, ...) for single-particle orbitals that are in the subspace, as opposed to Greek 
letters {a, f3, . . .) for general single-particle orbitals. In this Section we prove that if nT is an 
integer A^-representable 2DM, then the pair {p^,T^), defined as 

^ab;cd = NTab;cd (2.78) 



,V ^ 

N 



Pac 



-—r 2_^ NTal3;cl3 (2.79) 



is fractional- A^ representable on the subspace V, with A^ = Yla Paa ■ ^s atF is integer A^- 
representable, we can write: 

r^6;cd = j;x,(^f |at4a,a,|^f ) . (2.80) 

i 

We can expand each l^f"^) in Slater determinants, classified according to the number of 
subsystem orbitals they contain: 

N 

l*f) = E E %,^Js,)\sr,.,) , (2.81) 

j=0 SjSjv_j 
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in which Sj represents a set of j subsystem orbitals, and sat-j a set of N — j orbitals not 
in the subsystem. Using the fact that the string of subsystem-type creation/annihilation 



operators in Eq. (2.80) does not change the number of subsystem orbitals, that it leaves the 



non-subsystem part of the Slater determinant unchanged, and using orthonormality of the 
SN-j states, we see that 



^Xi(*f|4aJadae|^f) = ^Xi J^ (^i-^_>M«<^«c|*i 



isjv-j' ' 



where 






i JSN- 



Y.% 



iN-jl^V ' 



(2.82) 



(2.83) 



is a state with j particles in the Fock space generated by the subsystem orbitals. These states 
are not normalized, their norm is given by 






1 2 j 



(2.84) 



If we replace them by normalized states 









it follows that 



^^b;cd= Yl ^i'^l-SN-^^isj,_,\'^i4(^dac\^ 



«sjv- 



(2.85) 
(2.86) 



Xi^s^., = 1 ' 



(2.87) 



j;«Siv-j 
where 

j;isjv-j 

because of the normalization of the original A^-particle states. In an analogous way one shows 
that the subsystem IDM p^ can be written as 



P^c 



j;isN-j 



jsjv- 



(2.88) 



This proves that T and p can be derived from the same ensemble of wave functions con- 
taining only orbitals in the subsystem. This ensemble has a fractional number of particles (in 
the subsystem space) given by: 



^ = E 



V 
Paa 



^ jxiw: 



isN-j 



(2.89) 



r,isN-j 



We can again harness the power of the dual formulation of A^-representability (2.77) to 
introduce the subsystem constraints: 

Theorem 3. If n^ is integer N -representable, then for an arbitrary subspace of single-particle 
Hubert space V, the pair {p^ ,T^) as defined in Eqs. (2. IS) and (2. 79) must obey the inequality: 



Trp^t^ + Trr^F^>i?o'^(^^ 



(2.90) 



wi 



th N = Tip^ , and for every Hamiltonian H^ of the form (2. 76), defined in the subspace V. 
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2.3.1.3 Applicability of subsystem constraints 

Having established the explicit form of the subsystem constraints, one could ask the question: 
why would these constraints be important? One answer to that question is that Theorem [3] 
does not hold for approximate A^-representability, using the matrix positivity conditions intro- 
duced in the previous Sections. In other words, the fact that the global 2DM of the A^-particle 
system obeys necessary A^-representability conditions does not imply that the subsystem pairs 



(p^, r^) obey the same fractional A'^-representability conditions. In analogy to Eq. (2.75) one 
could say that the pair {f^p, f^T) is approximate fractional- A^ representable under a set of 
necessary matrix positivity conditions C, if and only if there exists a set of weights xjv in 
which jvP and ^F can be expanded as: 



nP = '^xn NP , (2.91) 

N 

fjT = ^XNNr, (2.92) 



N 

where for every jvF in the expansion holds that 

CiNT) h . (2.93) 

If we have an approximate integer- iV representable F, i.e. for which C{T) ^ for a set of 
>C's. The constraints imposed by this on a subspace 2DM T are not sufficient to ensure 
approximate fractional- A^ representability The subspace C{T)^^s are diagonal blocks of the 
full system £(r)'s, and will therefore be positive semidefinite: 

£(r)^ h . (2.94) 



This is, however, not sufficient to ensure that there exists an expansion of the form (2.92), in 



which, for every ^yF, Eq. (2.93) holds, which is required for it to be approximate fractional- A^ 
representable. This means that 2DM optimization under matrix positivity conditions doesn't 
treat the full system and the subsystem on equal footing. This has the important consequence 



that the method is not size consistent (more on that subject in Section 5.2). Imposing addi 



tional constraints on the subsystem can fix this, as was shown in [29] for diatomic molecules. 



For more general systems there are a plethora of subspaces and Hamiltonians to impose (2.90), 
so it is crucial to choose these carefully. A stochastical method in which these are optimized 
was proposed in |40j . Once the subspace and Hamiltonian are chosen one can calculate the 



right hand side of Eq. (2.90) using exact diagonalization if the subsystem is small enough, or 



a density matrix optimization if it is larger. 



2.3.2 The sharp conditions 



The sharp conditions are another type of condition that makes use of the dual definition of A^- 
representability. Here, once again the positive Hamiltonians WB are used, but in a completely 
different way: now sharper bounds are imposed on the upper and lower eigenvalues of these 
Hamiltonians [28j . 
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2.3.2.1 Sharp bounds on X(r) 

Canonical transformation We first show that a two-fermion creation operator: 

B^ = Y1 ^"/3«a4 ' (2.95) 

a/3 

with B a skew-symmetric matrix, can always be written as, 

^^ = ^E^"«"«^' (2.96) 



V2 



a 



through a canonical transformation of the single-particle basis. Each single-particle state a 
has its paired state a. For the sake of completeness, a short proof follows. 
The IDM of the two-particle state constructed by W has the form: 

p = 2B*B^ . (2.97) 

If we transform to the basis of natural orbitals, p is diagonal and real: 

p = p* which means that BB* = B*B . (2.98) 

In this basis the commutator of p and B is: 

[p,B] = 2[B*B'^,B] = -2[B*B,B] (2.99) 

= -2{B*BB - BB*B) (2.100) 

= -2[B*,B]B = 0. (2.101) 

It follows that the matrix B is block diagonal in the natural basis, with (2 x 2)-blocks since 
the eigenvalues of p are at least doubly degenerate ^13j : 

[p, BU = Ba^iK - A/3) = . (2.102) 

Let us now define a new blockdiagonal matrix D as: 

D(°) = J^B^'^^ , (2.103) 

V Aq 

in which i?(") is the S-block corresponding to the eigenvalue Aq,. D is skew-symmetric, 
but also unitary, which means that if we diagonalize it the eigenvalues are purely imaginary 
phases, i and —i, and the two eigenvectors are complex conjugate, X and X* . If one more 
transformation is performed: 

(2.104) 

(2.105) 

every block in D becomes: 

(2.106) 



x_ = 


■ v^i'-^-^"'- 


x+ = 


= ;,(x + x-), 


i 0\ 

-i, 


( 1 
^ -1 
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The same transformation turns B into: 



5(«) 



which means we can write W as (2.96) with 



Xf] 




and 



Xn 



(2.107) 



(2.108) 



Pairing Hamiltonian It turns out that the Hamiltonian B'B, with B' a pairing operator 



(2.96), is exactly solvable through a Bethe-ansatz approach [3Tll32lll3]. The eigenstates can 
be classified according to the presence of unpaired single-particle states. As the Hamiltonian 
doesn't connect states with a different number of unpaired states, we can restrict the discussion 
to the fully-paired eigenstates; in other subspaces the unpaired states are blocked, and simply 
removed from the available single-particle space. The eigenstates are labeled by n — 1 numbers 
y, in which n is the number of pairs present in the eigenstate: 



l*M) = 0^(O)</'^(yi)...0^(yn-i)|O) 



and a new pair-creation operator cp' is defined as: 



yx\ 



2^a'^a ■ 



(2.109) 



(2.110) 



One can show (see e.g. in [3]) that the action of the pairing Hamiltonian on the states (2.109) 
is: 



n-l 



n— 1 



B^mg})=Ei{y})\^f^y) + Y.V,{{y})B^B^[ H <^^(%) 1 |0> , (2.111) 



i=l 



in which 



2 / 1 "^1 1 



j(7^i)-- 



1 Vi - Vj 



Ei{y}) = E-1-^T.i 

k=i y^ 



(2.112) 



(2.113) 



From Eq. (2.111) it is clear that (2.109) will only be an eigenstate if the {y} satisfy the 
equations: 

Vi {{y}) = , Vi . (2.114) 



The energy of the eigenstate is then given by (2.113) . 



In general a system of non-linear equations can be hard to solve, but in this particular 
case, Newton's method can be used to find the largest eigenvalue. The different solutions are 
separated by singularities [44'. 45j , so we have to find an initial point in the right compartment 
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of y space. For the highest eigenvalue, a good guess is that all y's will be negative and close 
to zero. The following initial point: 

2/°=-^ for a=l,...,n-l, (2.115) 

was always observed to converge to the highest eigenvalue. 

It is now straightforward to introduce the new constraint, using the dual definition of 
A^-representability for the pairing Hamiltonian: 

- Tr FB^B > E^{-B^B) = -AJJ^'^^ [B] . (2.116) 

Note that the eigenvalue spectrum of the Hamiltonian B'B only depends on the singular 



values X as introduced in Eq. (2.96). We can rephrase this condition as follows: for every 
two-particle state, its occupation in the 2DM has to be smaller than the maximally allowed 
occupation in an A^-particle system calculated through the Richardson equation: 

Tr TB^B < Aj;^^^ [x(S)] . (2.117) 

It is interesting to see that for a structureless pairing operator: 

^^ = V 2M ^'^"'^"'^^ ' ^^*^ ^" ^ ~'^" ^ ^"^ ' (2.118) 

a 

which has the highest maximal eigenvalue of all possible B'^s, the maximal eigenvalue is 

Ar^ [x(X)] = n (l - ^^^) . (2.119) 

This is the sharpest upper bound that can be put on the eigenvalues F without knowledge of 
the eigenvectors, which was derived by F. Sasaki [36]. 



Finding the most stringent condition The constraint put forward in Eq. (2.117) is not 



practical yet, because we don't know how to choose W , and we can't impose the inequ ality 



for all B'^s. In this Section we propose a method to find the B' that violates Eq. (2.117) the 
most. Given an arbitrary 2DM F, we introduce a cost function F: 

HB) = ^^ [XT'" [xiB)] - Tr TB^b] , (2.120) 

which we minimize as a function of the matrix B. To optimize this function we need the 
gradient of F with respect to the pair amplitudes B^,^. The most difficult term is the first 
one as we need an analytical formula which tells how the maximal eigenvalue varies with B. 
Using the chain rule we find: 



dy§-^ [xjB)] ^ ^ /5Af-(x)\ (dx^ 
B,u ^A dx. )\dB^, 



K>0 



E^^Gi-. (--) 



where k > means we only sum over independent x^s, not over the x^- The relationship 
between B and x is a unitary transformation: 

Baf3 = V2'^UKaXKUfii3 or inversely ^"^ = ^ X^ UnaBa/sUap . (2.122) 
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This means that the second term in Eq. (2.121) is: 



dB 



/.tu 



1 

V2 



\y H^^ KU '-' HV'^ KfJbi 



(2.123) 



The first term can be derived from Eq. (|2.113) by differentiation: 



dXa 



n-1 



{x) = ^x^ + A^^^J^[x] 



7iyk\ 



dx. 



(2.124) 



which is written in function of the y's. It is left to determine how the y's change as a function 
of the structure coefficients x. The infinitesimally changed y's have to remain a solution to 
the Richardson equations, which leads to the expression: 



dVi 

dx,. 



ivi^)) 



4x, 



'^-Vi^lY 



+ 



M 



2T- 



1 



n-1 



n-1 



1 (1 - Vi^lf 
1 dyj 



+ E 



1 



Jif^i 



]li ivi - VjY 



4 y - 

iVi — ViY dx 



0, 



dyi 
dx,. 



(2.125) 



which yields n linear equations, that can easily be solved to obtain the derivatives. In 



summary, the gradient of the Eq. (2.120) can be written as: 



dF 
dBZ 



1 



Tr B^B 
2B 



1 Q\niax 



^^0 ^^- 



7(5 



7<5 



liU 



(Tr B'fBY 



[a?J^^ [x{B)] - Tr TB^B 



(2.126) 



After a 2DM optimization, the most violated condition can now be found using a non-linear 
conjugate gradient algorithm [37], and can be added as a constraint in a subsequent opti- 
mization. This can be done iteratively, until there are no violated constraints left. 



2.3.2.2 Sharp bounds on Q{T) 

Having established a sharp bound on X(r), we can ask ourselves if the same thing can be 
done for the other constraint matrices. If an upper bound can be found for the eigenvalues 
of Q, the following constraint can be formulated: 



TV Q{V)BB^ < E^,^{BB^) . 



(2.127) 



It turns out that the eigenvalues oi H = BB' can also be found using a Richardson-like 
approach discussed in the previous paragraph. Suppose we have found an eigenvector of BB^ 
with eigenvalue A: 



-,N\ 



BB^\^^) = X\^'') . (2.128) 

If we let another W operator act on the left and right side of this equation we have: 



B^B\^ 



N+2\ 



X\^ 



N+2\ 



with 



1^ 



N+2\ 



B^\^ 



N\ 



(2.129) 
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which means that every non-zero eigenvalue of BW on the A^-particle system is also an 
eigenvalue of B'B on the N + 2-particle system. In an analogous way one can show that 
the inverse is also true, which means that the maximal eigenvalue of BB^ on an iV-particle 



system can be found by solving the Richardson equations (2.114) for A^ + 2 particles ! Given 



an arbitrary 2DM T, the most violating two-hole operator B can be found by optimizing the 
cost function: 



Fq{B) 



1 



Tr B'lB 






{x{B))-Tr Q{T)BB^ 



(2.130) 



for which the gradient can be calculated in the same way as for (2.126) 



Using the approach introduced in the previous two paragraphs we constructed additional 
constraints to increase the accuracy of the standard two-index conditions. In practice, how- 



ever, we found that for molecular and atomic systems, the sharp-X (2.117) and sharp-Q (2.127) 



conditions are never violated if the Q condition is active. So in general these constraints did 
not deliver the increased accuracy hoped for. As would be expected, the constraints did help 
for pairing type Hamiltonians: 



H 






^^XaXisa\a\apap 



(2.131) 



aP 



when g is large enough. They insure that for g ^- oo the correct limit is found. 



2.3.2.3 Sharp bounds on ^(r) 

For the Q condition, it is not possible to transform the Hamiltonian to a type solvable by the 
Richardson equations. For a general ^-type 13' operator: 



a/3 



(2.132) 



the problem seems to be very hard, and no solution has thus far been found. For a restricted 
class of B' operators, when i? is a Hermitian matrix, it is possible to derive some constraints 
of the form: 



Tr 



^(r)^^^] < E^^^iB^B) , and Tr [^(r)^^^] > E^^^{B^ B) . (2.133) 



Compared to the solution for sharp-X and sharp-Q, this derivation is relatively easy. Consider 
a Hermitian operator W = B, which in its canonical basis looks like: 



B' = E 



CrvOi^Cln 



(2.134) 



Its eigenvectors on A^-particle space are Slater determinants with as eigenvalues the sum of the 
energies of the A^ occupied orbitals. Because 13' is Hermitian, B'I3 is just B^, which has the 
same eigenvectors as B\ but with the eigenvalues squared. Now that we have diagonalized 
the Hamiltonian, we just have to find out for which set of N single-particle energies S, the 
eigenvalue: 



Es 




(2.135) 



is extremal. 
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Maximal eigenvalue For the maximal eigenvalue there are only two possibilities, either 
the N highest or the N lowest single-particle energies sum up to the highest absolute value. 



Minimal eigenvalue Because the different terms in the sum (2.135) can cancel each other, 
the lowest eigenvalue is much more difficult to obtain. Basically any combination of single- 
particle energies can have the lowest absolute value, which means it is a combinatorial op- 
timization problem. This constraint is probably very important, as the Q map seems most 
closely related to the structure of physical Hamiltonians studied in physics and chemistry. One 
example of where this constraint would surely improve results is in the case of a dispersion 
Hamiltonian: 

D{B\X) = (b^ -Xi) , (2.136) 

where it has been shown that the standard matrix positivity conditions fail drastically j48j . 

2.3.3 Diagonal constraints 

The diagonal conditions are a hierarchic set of linear inequalities that only involve the diagonal 
part of the 2DM. The first conditions of this type were derived by Weinhold and Wilson [39], 
and later generalized and expanded by Davidson [50ll51j. Davidson derived these constraints 
using the dual definition of A^-representability with the following Hamiltonian: 

H = A + '^ Baolaa + ^ Cai^aia'i^ai^aa ■ (2.137) 

The expression of the energy expectation value of this type of Hamiltonian as a function of 
the 2DM only involves diagonal elements of the 2DM. The eigenstates of this Hamiltonian 
are Slater determinants, with energy eigenvalue: 



A 2DM will satisfy the A^-representability condition (2.13) for every Hamiltonian of the type 



(2.137) if it satisfies (2.13) for the extreme points of the convex set of Hamiltonians (2.137). 
These extreme points generate the diagonal inequalities, and in [501 EI] a complicated general 
algorithm is used to derive some of these. In [22] however, a much simpler way of deriving 



these conditions is introduced, which involves the positivy of polynomials of the type: 

(*^| Y, ya^^...nanph^ ■ ■ ■ 1^"^) > , (2.139) 

a/37... 

in which ha is the number operator aada- The so-called (2, 2)-conditions are given by: 

{hahis) > , {{l-ha)hi3) > 0, 

{ha{l-h^)) > and ((l-n„)(l-%)) > 0, 

which are just the diagonal elements of the two-index conditions. In the same way one 
can derive (3, 3)-conditions on the 3DM, in which the three-particle term sometimes has 
a positive and sometimes a negative prefactor. Adding these constraints together one can 
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construct new constraints only containing diagonal elements of the 2DM, which are called the 
(3, 2)-conditions, e.g. the sum: 

{hahph-y) + ((1 — nQ,)(l — n/3)(l — n^)) = (1 — n^ — n^ — n^ + nan/3 + hah^ +nph^) , (2.140) 

which is the diagonal 71 . It is possible to continue the construction of higher-order {r,r)- 
conditions in this way. By taking the appropriate combination of these (r, r)-conditions one 
derives higher-order constraints that can be expressed as a function of the diagonal terms of 
the 2DM alone, called (r, 2)-conditions. 

What is nice about these constraints is that they are computationally much cheaper 
to impose than the full matrix-positivity conditions. Imposing all fourth-order constraints 
would scale in the same way as imposing the full ^-matrix condition. A disadvantage of this 
technique is that the results are dependent on the choice of the single-particle basis used, i.e. 
we lose unitary invariance of our approximate A^-representability. One can envision two ways 
to try to cure this, which have not been examined up to now (but should be in the future). 
The first way is by imposing that the optimized energy is stationary to infinitesimal unitary 
transformations. A unitary transformation U is defined by its anti-hermitian generator e: 

U = e^^ . (2.141) 

An infinitesimal unitary transformation can therefore be expressed asU = 1 + Ae. The change 
in the energy expression under influence of this infinitesimal transformation is, to first order 
in A: 



af^"fS \ a' 

13' 7' S' 

= Tr ri^ + ^ [ j; ey, Yl iHT)^8-ri'S " E ^-' E (^r)„^^„,^ 
y 77' <5 aa' P J 

a/3 

in which a bar over a matrix indicates that we trace over one pair of indices: 

fa7 = 2_^'^ciP-nP ■ (2.143) 



From Eq. (2.142) it is clear that the energy will be stable to infinitesimal unitary transfor- 



mations if we impose the constraint: 



[r,/7]=0. (2.144) 



A second way to improve the result obtained by diagonal conditions is by optimizing 
the single-particle basis. Suppose we have performed an optimization using only two- index 
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conditions, and in the next iteration we want to impose diagonal three- index conditions of 
the Ti type: 

^2 (^)al3'y = PjJ + ^al3;aP " '^I3r,l3'y ~ ^ar,a'y > . (2.145) 

How to decide what single-particle basis to use in which to impose these? We could search 
for the most violating single-particle basis by minimizing the following functional: 

Fal3-fiU) = 2_^^l'lPl'-l'^l'l ^Z_^^a'aU*P'l3^a>l3';a'l3>Ua>aUl3il3 (2.146) 

7' a'/3' 

with respect to the unitary matrix U. 



CHAPTER 3 



Semidefinite programming 



The variational determination of the 2DM of a quantum system, henceforth called the v2DM 
technique, can be formulated mathematically as a semidefinite program (SDP). A semidefinite 
program is not a program, but a type of constrained optimization, in which a cost function is 
optimized under the constraint that a matrix remains positive semidefinite. There is a vast 
literature on this subject, and a nice duality theory has been established. In this Chapter 
we first give a short introduction to some important results from the literature, and show 
how to formulate density matrix optimization as an SDP. After this a number of standard 
algorithms, which have been tailored to the specific form of v2DM, are explained in some 
detail. 



3.1 Hermit ian adjoint maps 

For the following it is useful to introduce the Hermitian adjoints of matrix maps introduced 



in Section 2.2 The Hermitian adjoint maps are defined through: 

Tr Ci{T)A = TV 4(^)r , (3.1) 

in which j4 is a matrix of the same dimension as the image of the map £j in question (e.g. a 

three-particle matrix for a 7i map, etc.), and the trace is a sum over the appropriate indices. 

It is important to note that from now on slightly different matrix maps are used com- 



pared to the ones introduced in Section 2.2 It is mathematically and computationally more 
pleasing to deal with maps that are homogeneous in F, so we add a term j^/j!^_-^\ to the 
non-homogeneous terms. 

The X and Q maps are Hermitian, so they are identical to their Hermitian adjoints. For 



the other maps however this is not the case. Using Eq. (3.1) the Hermitian adjoint of the Q 
map can be shown to have the form: 

S (^)a/3;7<5 = isj _ -i [^I35^a-y - ^aS^fS-y - ^jS^AaS + 5a'yAi3s] (3.2) 
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in which a particle-hole matrix A is mapped on two-particle matrix space and 



(3.3) 



The 7i operator maps a two-particle matrix on a three-particle matrix, so its Hermitian 



adjoint has to map a three-particle matrix A on two-particle matrix space. Solving Eq. (3.1) 
with £ = 71 one finds that: 



V i^)a^;^S = j;^7J^—^{^ajSi35-dasSpy)TTA + Aap.^s 



(3.4) 



1 



with 



2(iV-l) 

"4 

-^07 



SpsAa^y — SasAp^y — 6j3^Aa5 + Sa-yAps 

= /_^Aa/3X;ySX , 
A 

— / ^ -'^aXu-j'yXK • 
Are 



(3.5) 
(3.6) 



In the same way one can derive for £ = Ti that 



^2 (^)a/3;75 — ^/^ _ . n '^/3<5^a7 " ^aS^jB^ — (5/3^^a5 + ^ayAps + ^a/3;75 (3.7) 

~ ^<5a;/37 ~ ^(5/3;a7 ~ ^7q:;/3(5 + ^7/3;a(5 



where A is a matrix on two-particle-one-hole space and 

i 

^a/3;7<5 



IQ,^ 



^a/3;7(5 



/ ^ ^AKa;Are7 ) 
Are 

/ ^^a/3A;7<5A , 
A 

2^^Aa/3;A7<5 • 
A 



(3.8) 

(3.9) 

(3.10) 



The Hermitian adjoint of T2 is slightly more involved. We construct the adjoint by demanding 
that: 

'T2 (r) u\ (At a 



Tr 7^'(r)y4 = Tr 



a;T p) \Al A 



it 



-'t 



Tr7^'T(A)r, 



(3.11) 



is fulfilled. This leads to the following expression for the 7^ map: 



+W^ {^^' ^^"^-^^ " ^""^ ^"^p^-yP ~ ^^^ ^^^^^- + '^"^ ^^'^^'^/^) • ^^-^^^ 
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3.2 Primal and dual semidefinite programs 

The general form of a semidefinite program [53j, in its primal formulation, is given by: 

max-TrXu° u.c.t. X^O and Tt Xu' = h\ (3.13) 

in which a matrix X is varied to optimize a linear function (— Tr Xu^) under the con- 
straint that it remains positive semidefinite {X ^ 0), and under a set of equality constraints 
(Tr Xu^ = h^). This problem is completely defined by the set of matrices u" = {u^,u^}, 
and a vector h. As with many constrained optimization problems, one can define a dual 
optimization problem using the Lagrangian of the original problem [5^ : 



mm 

7 



^7i/i* u.c.t. Z = u° + ^jiu'hO ■ (3.14) 



In this case a matrix function of the dual variable 7 has to remain positive definite. An 
important feature of this duality is that the primal objective function always bounds the dual 
from below (and vice- versa), when both primal and dual variables satisfy all equality and 
inequality constraints. This can be inferred from: 

J2 lih' + Tr Xn° = ^ 7iTr Xu' + TrXu° = Tr XZ > , (3.15) 

i i 

because both Z and X are positive semidefinite. When the primal and dual problems are 
getting closer to their optimal values, they will move towards each other. We call the diff'erence 
between the primal and dual optimal value the primal-dual gap: 

r] = TT XZ . (3.16) 

It has been shown that (when both primal and dual feasible regions^ aren't empty) the primal- 
dual gap vanishes at the optimal value of X and 7 [55J. Because X and Z are also positive 
semidefinite at their optimum, it follows that the much stronger complementary slackness 
condition holds when both X and Z are optimal: 

XZ = 0. (3.17) 

3.3 Formulation of v2DM as a semidefinite program 

In v2DM we want to optimize the energy by varying a matrix, the 2DM, under the constraints 
that it has the right particle number, and that some linear matrix maps of the 2DM are 
positive semidefinite, i.e. 

EsYiv{Hy) = minTr Wh^^'i] (3.18) 

r TV r- ^(^-1) 

u.c.t. { ^ ~ 2 ' (3.19) 

1 £, (r) ^ . ^ ^ 



Here the Cj are a collection of constraints as defined in Section 2.2 This is obviously a very 
similar problem to the SDP introduced in the previous Section. The explicit connection is 
worked out in this Section for both the primal and dual formulation. 

^The feasible region consists of the set of variables for which all the equality and inequality constraints are 
satisfied. 
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3.3.1 Primal formulation 

In a primal SDP one varies over a matrix X, which has to be positive semidefinite. It is 
therefore reasonable to have X as a blockmatrix, with the different constraint matrices as 
blocks: 

^ = 0^^. ^0- (3-20) 

j 

Prom now on we drop the index j on the constraints Cj when talking about a general constraint 
different from I. The matrix X contains free-ranging variables, so we need to put linear 
constraints on these variables to make sure that, for all matrix inequalities C: 

Xc = C (Xx) . (3.21) 

This can be done using the linear constraints of the primal SDP. If we introduce a complete 
and orthonormal basis of the constraint-matrix spaces, {g]r}, we can reformulate the linear 
equalities that have to be fulfilled as (where the trace is on i2-space): 

Tr Xcgl = Tr £ (Xx) gi , (3.22) 

or, using the Hermitian adjoint maps defined in the previous Section: 

TrXcgi: = TTXxC^g]:) . (3.23) 



The equalities (3.21) can now be imposed in a standard primal SDP fashion: 

Tr Xi/c = > (3-24) 

in which n^ is a blockmatrix with two non-zero blocks, the I and the C block: 

{v'c)x = -^Hac) , (3.25) 

{u]:)c = 9'c- (3-26) 

As an example, when the active constraints are IQQ, the Ug matrices which impose the linear 
constraints on the Xg matrix are: 

-SHah) o\ 

0. (3.27) 

gy 

The particle number constraint can easily be written in the standard form: 



T, x„- = ^'^^- " , (3.28) 

where only the I block of u^'^ is non-zero, and equal to the unit matrix on two-particle space: 

[{u '')x]a/3;7<5 = la,3;7<5 = Sa^SjSS - ^aS^p^ . (3.29) 

For the energy to be optimized we only have to set the X block of u^ equal to the Hamiltonian 
and the rest equal to zero. Now we have everything to express v2DM as a primal SDP, and 
use standard SDP algorithms to solve it. This primal formulation was used by M. Nakata 
|19j in his pioneering article on variational density matrix optimization. 
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3.3.2 Dual formulation 

To express v2DM as a dual SDP it is useful to introduce a complete, orthonormal basis of 
traceless two-particle matrix space {/'}, which satisfies the following relationships: 

Tv f = , (3.30) 

Tr ff = 5'^ , (3.31) 

/a/3;7(5 = ~//3a;7<5 = ~fs-y;al3 = fs-y;/3a ■ (3.32) 

In this basis every 2DM can be decomposed as: 

If we define: 

7i = Tr Tf and h' = Tr H^'^^ f , (3.34) 

the energy expression can be written as: 



..^ 2 Tr r) (Tr H^^A ^^ 

MM -I) ^ Z^ " 



(3.35) 



When the particle number is fixed, one can see that the only freedom one has to optimize the 
energy is in the traceless part of the matrix. This means that optimizing over the matrix T 
under the particle number constraint is equivalent to: 



mm 

7 



^lih" . (3.36) 



Since the matrix maps C are linear and homogeneous in F, one can write: 
Defining: 

^° = mImIi) ® ^^ (^) ^"^ "^ = ® ^^ (^^) ' (^-3^) 

J 3 

it is clear that the constraint 

Z = u° + ^ 7iU^ ^ , (3.39) 

i 

is equivalent to constraining all the matrix maps to be positive definite. One can see that it 
is far more natural to write the v2DM problem as a dual than as a primal SDP. There are 
considerably less variables to be optimized, and the constraints are much easier to apply. In 
the rest of the thesis we therefore use the dual formulation. 

3.4 Interior point methods 

The first class of algorithms to discuss are interior point methods [:55j. These methods try 
to optimize the cost function while staying inside of the feasible region. Interior point meth- 
ods are generally very stable and have nice convergence properties, but are computationally 
demanding because at every iteration a large linear system has to be solved. An important 
concept for interior point methods is the central path. 
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Tr rH(2) = c 



-'-^min 




max 



Figure 3.1: The central path is defined by the minima of the potential barrier cj), in all 
Hamiltonian planes with allowed energies. One can see that both the maximal and minimal energy 
lie on the central path. 

3.4.1 The central path 

To define the central path we introduce a barrier function, which has the property of self- 
concordancaj [56], defined on the feasible region: 



^il) 



lndetZ(7) . 



(3.40) 



This function is rather flat on the inside of the feasible region, becomes larger and larger when 
coming closer to the boundary and is +oo on the edges of the feasible region. The point that 
minimizes this potential is called the analytical center of the feasible region. For the following 
set of optimization problems: 



min0(7) 

7 



u.c.t. 






7^h' 



(3.41) 



the optima, for all allowed energies £"„ 



< e < E„ 



define the central path, as shown in 



Figure 3.1 Both the maximal and minimal energy obtainable lie on the central path. This 
is important because interior point methods try to follow the central path in their search for 



^An R — ^ R function f{x) is self-concordant, if \f"\x)\ < 2/"(a;)5. A multi dimensional R" -> R function 
g is self-concordant if the restriction of g to any line segment is self-concordant. This property is primarily 
important in the convergence analysis of various algorithms. 
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the optimum, since the barrier function becomes singular on the edge of the feasible region, 
which should be avoided as long a possible. 

There are some nice properties associated with the central path of the primal and the dual 



problem. To see this, consider the optimality conditions for (3.41 ), which is that the gradient 
of Lagrangian of the problem has to vanish, i.e.: 

Tr Z-'^u' = Xh} , (3.42) 



with A some Lagrange multiplier. The fact that Eq. (3.42) shows a remarkable similarity 
to the primal feasibility condition is no coincidence. The matrix Z~^ /X is not only primal 
feasible, but can be shown to be the solution to the following optimization problem: 

min - Indet X u.c.t. Tr Xu^ = hj- and - Tr XvP = rj - ^ , (3.43) 

X A 

with n the dimension of the matrix X. This means there is a pairing between points on the 
central path of the primal and the dual problem. For every point on the dual central path 
Zc, there is one on the primal central path Xc, and up to a scale factor they are each others 
inverse: 

X,Z, = "^1. (3.44) 

n 

When we have reached to optimum, the primal-dual gap vanishes and we retain the complen- 
tary slackness condition: 

X*Z* = . (3.45) 



3.4.2 Dual-only potential reduction method 

This is the most transparent interior point algorithm, and it only considers the dual problem. 
It is a conceptually simple algorithm that is very flexible and easy to adapt to the specific 
structure of the problem at hand. The idea is to use a potential barrier function to impose 
the positive semidefiniteness of Z. If we minimize the following potential: 

'A(7) = Yl ^'^' - * 1^ det Z(7) , (3.46) 



for a certain value of t, the optimal value lies where the two competing terms in Eq. (3.46) 
are in balance. The energy term tries to move 7 in the negative direction of h, but as it gets 
closer to the edge of the feasible region, the potential gets steeper. The parameter t decides 
where this balance occurs, and the smaller it becomes, the closer to the edge the optimum 
is. As a function of t, the optimum 7*(i) lies on the central path. For large t, 7*(t) lies 
around the analytic center, for i — )• 0, 7*(t) lies on the edge, which is exactly the solution to 



the dual SDP. The idea of the potential reduction method is to solve Eq. (3.46) for a large 
enough value of t, and use the solution as a starting point for a subsequent optimization with 
a smaller value of t. This is repeated until t is small enough, and we have converged to a 
point close to the edge. 
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3.4.2.1 Solution for fixed penalty 



To solve the optimization problem Eq. (3.46) we use the Newton-Raphson method. This 



method approximates the non-linear potential by its second-order Taylor expansion: 

*(7„ + h) » *(7o) + E*^. (g) +i*^- (a^*^ . (3-«) 

and then searches for the step 6^ that minimizes this expansion. The optimality conditions 



for Eq. (3.47) lead to the following system of linear equations to determine 6^: 



(3.48) 



When sufficiently close to the optimum, the Newton method is known to converge quadrati- 



cally. The only problem left is the solution of the linear equations (3.48). 



The gradient: the gradient of Eq. (3.46) reads: 

V(/.^ = h' - tTv Z-^u' 
and can be rewritten as: 



(3.49) 



V<A^ = /i^-tJ^Tr [£,(r)-i£,(r)] 
j 



(3.50) 



This means we can express the gradient as a matrix, without any reference to the basis {/*}: 



Vc^ = PTr(H('^-tY.c][cArr']\ , 



(3.51) 



where Pxr is the projection on traceless 2DM-space: 

- ,,, , 2Tr^ 



(3.52) 



The Hessian: the Hessian is the matrix formed by the second derivatives of Eq. (3.46): 

W^=tTr [Z-WZ~\^]=tY,T^r [Ck{T)-'CkinCk{r)-'Cj,{n] . (3.53) 

k 

The action of the Hessian on any traceless matrix A = ^- dip can be expressed as: 



Y^W^S, = tJ^Tr 



Ck{v)-^ [Y^^kinsA cu{T)-^Ck{r 



t^Tr 4 (£fe(r)-i£fc(A)£fc(r)-i) / 



(3.54) 
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It follows that the action of the Hessian on a traceless matrix A can be written as another 
matrix: 



nA = tp-T, 



^4(A(r)-i£fc(A)£fc(r)-i) 



(3.55) 



It is important to notice that Eq. (3.55) constitutes a very efficient matrix- vector product 



for the linear system Eq. (3.48). If the dimension of single-particle space is M, a Hessian 



matrix- vector product would scale normally as M^. The special structure of the Hessian for 



physical problems implies that the matrix-vector product in Eq. (3.55) scales as only M^ . In 



addition, the Hessian is positive definite and symmetric, and this makes the linear conjugate 
gradient method an attractive way to solve the linear system iteratively |47]. In this way we 
do not need to construct and store the Hessian, but just use it through its action on matrices. 
We also have no need to choose an explicit basis {/*} because everything can be expressed in 
terms of matrices. 

The line search: once we have found the direction A that minimizes the quadratic ap- 



proximation of the potential (3.46), we can speed up the convergence of the Newton method 



by minimizing (3.46) as a function of the steplength a taken in this direction, i.e. find the a 
for which: 

V^(t>{a) = ^<P{-f + a6-f) = . (3.56) 



da 



This function can be calculated analytically: 

Va0(a) = Tr Ai7(2) - t ^ Tr \Cj (E + aA)"^ Cj{A) 



(3.57) 



and evaluated for any a (given a 2DM E and a direction A). Every evaluation of this 
function involves inverting a matrix, which means this is quite slow. There is a way to 



simplify Eq. (3.57) by solving the generalized eigenvalue problem: 

C{A)w = X^C{T)w . 



(3.58) 



This can be transformed to a normal symmetric eigenvalue problem with real eigenvalues and 
orthogonal eigenvectors: 

(c{r)-^C{A)C{r)-^]v = X^v with v = C{r)^w. (3.59) 

The completeness of the eigenvectors v of the above problem, implies that: 



£(E) = ciT)h r£v^vrj ^r)K 

£(A) = £(E)Uj;Af^;,,z;n£(E)i 



(3.60) 
(3.61) 



Using Eqs. (3.60) and (3.61) it follows that: 



£(E + aA) = £(E); 



y^(l + aXf)viV, 



C(TY 



(3.62) 
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and we can express the matrixproduct term in Eq. (3.57) as: 



C{T + aAy'C{A) = C{T) 



c{Ty 



^ Vl + c 



aAf 



^c„. „,r 



^U 



Af 



ViV, 



Yl ^^^j^i 



£(r)i 



£(r) 



(3.63) 



in which the orthonorniahty of the eigenvectors has been used. The trace of Eq. (3.63) can 
therefore be expressed purely in terms of the eigenvalues A : 



Tr 



Af 



£(r + aA)-£(A)J^ErTtAf 



(3.64) 



It is now clear that Eq. (3.57) can be expressed in terms of the eigenvalues of Eq. (3.58): 



V„0(a) = TrAi/(2)_t^K^ 



xf' 



1 + aXf' 



(3.65) 



Once the generalized eigenvalue problem Eq. (3.58) is solved, one can evaluate (3.65) as a 



scalar function for any value of a, and the bisection method to solve Eq. (3.56) becomes very 
efficient. 



Optimal value for fixed t: the optimal value of the optimization problem for fixed t is 
located where the gradient is zero: 



V0* = h' -tTr [Z-\'] = . 
This point lies on the central path, and we see that the matrix: 



X = tz 



-1 



(3.66) 



(3.67) 



is primal feasible. Based on Eq. (3.42), we can estimate that the primal-dual gap, when we 



have optimized for a certain value of t, is given by: 



rj = Ti ZX = nt 



(3.68) 



and use this number as a convergence criterion in the program. 



3.4.2.2 Outline of the algorithm 

To summarize the different aspects of the method exposed in the previous paragraphs a 
schematic outline of the algorithm is given below: 
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Algorithm 1 The dual-only potential reduction algorithm 
Choose e > ; e^ew > 0; < /? < 1 



M{M-1)- 
while nt > € do 



do 



while 6ncw > enew 

Solve nA = -V(/) for A 



Diagonalize Ci{Ty 


-^A 


(A)£,(r)-^ 


Solve Tt AF(2) - 


ty^, 


(^. 








\ 


1+aA^ ^ 


r ^ r + aA 








5 = a\\A\\ 








end while 








t^ Pt 








end ^vhile 









i£,; 



t> Newton-Raphson loop 
Linear Conjugate gradient method 



for a 



3.4.2.3 Adding linear inequality constraints 

Suppose we want to add a number of additional constraints of the form: 

Tr TCi > Ci for i = 1 , . . . , m . 



(3.69) 



How can we fit these constraints into the formalism derived in the previous Sections? For 
the dual-only potential reduction algorithm it turns out to be quite straightforward. We can 



rewrite Eq. (3.69) 



L{T)ii = TV rC° = Tr 



T[a 



2c,: 



N{N -I) 



>0, 



(3.70) 



as a diagonal matrix of dimension m that has to be positive semidefinite. The standard 
formulation of the dual SDP can still be used, and we just have to change the matrices 
{u*^,u*} in Eq. (3.38) to include the new constraints: 



^0^ iV(JV-i) 
M(M-l) 



©^.(1) 



®m 



©^.(r) 



©Mr 



(3.71) 



The changes in the algorithm pertain to the calculation of the gradient, the Hessian map and 
the line search function. 



The gradient: the gradient is still of the form Eq. ( |3.49[ ), but is now extended as: 

V<^^ = /i^ - tJ^Tr [£,(r)-i£,(f )] - t^L(r)T/L(r),, . (3.72) 



The last term can be written as 



fil 



L,jin = Tr [C^f 



(3.73) 
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so the matrix representation of the gradient reads: 



v^ = PTr U'^ -tY^c] [c, ivr] -tY^mjic^] 



(3.74) 



The Hessian: when hnear constraints are added, the Hessian matrix gets the form: 

W^ = i ^ Tr [Ck{r)-'Ck{f)Ck{r)-'Ck{n] + t Y, L{T)l^L{f)kkL{P)kk • (3.75) 
k k 

The action of the Hessian on a traceless matrix is now: 

Y,n''s,=tY,TT [£,(r)-iA(A)A(r)-i£,(r)] +t J] (r^) ^^^kk , (3.76) 



] k 

which can be written as a matrix: 



n^ = tPTr 



Y^ci (A(r)-A,(A)A,(r)-) +5: (1^) c: 



(3.77) 



The line search: the terms that are added to the line search function (3.56) are already 
scalar, and if we define: 

L{T)ii ' 



A 



(3.78) 



the added terms are of exactly the same form as the original ones, and the new line search 
function becomes: 



V„<A(a) = TrAi^(2)-t^^ 



xf' 



1 + aXf' 



*Er 



Af 



+ aXf 



(3.79) 



3.4.2.4 Adding linear equality constraints 

Adding linear equality constraints of the form: 



Tr r^, 



n — cn ) 



(3.80) 



can be necessary when we want to restrict the optimization to a certain spin expectation 
value, or if we want to impose some symmetry on the 2DM. It is relatively easy to include 
this in the algorithm: we just have change the u^ and n* matrices slightly, and define a new 
orthogonal basis {/*} which is not only traceless, but also orthogonal to all the {-E„}'s. For 
this we first introduce a new orthogonal set of matrices {En}, which is a basis for the space 
spanned by the {E'^} = {1, En}: 



En = J2^S-'2 )nmE'„, with Snm = Tr E'nE, 



m 



(3.81) 



Applying the same transformation to the corresponding scalars {e^} = { — ^ — ,en}'- 



E(^-) 



nm^ni ' 



(3.82) 
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we can expand every 2DM satisfying the equality constraints (3.80) as: 



(3.83) 



in which the set of matrices {/*} form an orthonormal basis that spans the orthogonal com- 
plement of {En}. This can be represented as a dual SDP, in which the u* matrices are still 
defined as in Eq. (3.38 ), but with the new definition for the /*'s. The u^ matrix can be written 
as: 

^° = ©A(/°), (3.84) 



where / is defined as: 



f = Y,enEn 



(3.85) 



The algorithm itself remains basically unaltered. The only things that change are the gradient 
and the Hessian map. As {/*} is now not only t racel ess bu t also orthogonal to the E^s, we 
have to replace the projection Piv in Eqs. (3.51) and (3.55) with a new projection operator: 



P/(^)=^-^Tr 



AE„ 



En 



(3.86) 



3.4.3 Primal-dual predictor-corrector method 

The next algorithm solves both the dual and the primal problem at the same time [32]- It is 
a much more complex algorithm than the dual-only method, and conceptually not so simple 
to describe. At first sight, it would seem superfluous to solve both the primal and the dual 
version of the problem, as the solution to the dual alone gives us everything we need. But as 
we will show, knowledge of the primal problem can speed up convergence for the dual, and 
vice- versa. By exploiting the specific structure of the linear matrix maps, we are able to solve 
the primal and the dual at almost the same cost as solving the dual alone. The method we use 
is a path-following method, which means we won't optimize some potential, but try to follow 
the central path which leads us to the optimal value. Starting from a feasible primal-dual 
starting point {X, Z), we try to find a new primal-dual point: 



iZ + Az)iX + Ax) 



u—l 
n 



with 



< 1/ < 1 , 



(3.87) 



that has a reduced primal-dual gap and is located on the central path. The problem with 
Eq. (3.87) is that the left-hand side is not a symmetric matrix. There are several ways 



to symmetrize this equation, which lead to systems of linear equations that determine the 
primal-dual direction (Ax, A^). We follow the approach introduced by J. F. Sturm [57] , 



3.4.3.1 Equations of motion 

To derive the equations of motion for (Ax, A^) we first consider how a semidefinite program 
transforms under general linear transformations L. If the primal matrix X transforms as 
(where L"^ = (L'^)^): 

X = L^XL , (3.88) 
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obviously the conditions 

XhO and X tO , (3.89) 

are equivalent. If the objective function and the linear constraints are to remain the same, 
i.e. 

Tr Xu^ = Tr Xn° , and Tr Xu' = Tr Xu' = h' , (3.90) 

the matrices {u°} = {vP,u''} have to transform as: 

u" = L-^u^L-^ . (3.91) 

This means that the dual matrix transforms in the same way: 

Z = u° + ^ 7in* = L-^ZL-^ . (3.92) 

i 

As a result, the primal-dual gap remains invariant, 

Tr XZ = Tr L^XLL-^ZL-^ = Tr XZ , (3.93) 

and the central path is mapped on itself: 

XZ = L^XLL-^ZL~'^ = -1 . (3.94) 

n 

Now consider the transformation L^ that maps X and Z to the same matrix V: 

L'^XLd = L-/ZL-/ = V . (3.95) 



For an L^ satisfying (3.95) the following also holds: 

LdLlXLdL^^ = Z , (3.96) 

from which it follows that: 

(x^LdL^X^^ =X^ZX^ . (3.97) 

All of this implies that L^ satisfies the following equation: 

D{X, Z) := X-^ (x^ZX^) ^ X--^ = L^L^ , (3.98) 

and Ld can be constructed by taking a Cholesky decomposition of D{X, Z). Even more, when 
we right-multiply L^ with an arbitrary orthogonal transformation the product still satisfies 



(3.98). It follows from Eq. (3.95) that this freedom can be exploited to transform V into a 



diagonal matrix. For a feasible primal-dual point {X, Z), we can use an L^ of the form (3.98) 
to transform it into {X, Z) = (V, V). We can now ask what step {Dx, Dz) in this transformed 
space we have to take in order to satisfy: 



(V + Dx)(V + Dz) = —1 , u.c.t. TtDxu' = and Dz = y^ dfu' . (3.99) 



Ignoring the quadratic term, this reduces to: 

V'^ + V{Dx+Dz) = —l, or Dx + Dz = —V-^ -V , (3.100) 

n n 
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which expresses the primal-dual step in the transformed space as a function of V. If we trans- 
form thisback to the original space, we obtain two equivalent equations. By left-multiplying 
Eq. (3.100) with L^ and right-multiplying with L^ one obtains what we call the dual 
equation: 



Ax + D{X,Z)-^AzD{X,Z) 



-1 _ "V ry-l 



z- 



X 



n 



Multiplying to the left with L^ and to the right with LT yields the primal equation: 



^^-1 



Az + D{X, Z)AxD{X, Z) = ^X"^ - Z 

n 

In both equations the primal and dual step have to satisfy the equalities: 

Tr Axu* = and A^ = ^ S-yiu^ , 

i 

from which it follows that they are orthogonal to each other: 

Tr AxAz = . 



(3.101) 



(3.102) 



(3.103) 



(3.104) 



These equations can also been derived in a primal-dual potential-reduction approach, as shown 
by Nesterov and Todd in |58| . 

3.4.3.2 The overlap matrix 



The non-orthogonal matrices {u°} = {n'^, n*} as defined in Eq. (3.38 ), are linearly independent 
and they span a subspace of the total space the matrices X and Z inhabit. If we want to 
project a general matrix on the subspace spanned by the m°'s, we need an expression for the 
overlap matrix, which is defined as: 



5„« = Tr u"u^ 



Using the Hermitian adjoints of the linear maps C we can rewrite Eq. (|3.105|) as: 

s^p = Y,^T f4(A(r))/^ 



(3.105) 



(3.106) 



in which {/"} is the orthogonal basis of traceless 2DM space {/*}, expanded with the nor- 
malized unity matrix f^. From Eq.(3.106) it follows that the overlap matrix can be seen as 



a linear map from two-particle space on itself, whose action on a two-particle matrix T is: 

5(r) = ^4(£fc(r)) . (3.107) 

k 

It turns out that this map can be written as a generalized Q map, which is defined as: 

Q(a,6,c)(r)„^^^5 



— C (^o^F/j^ — SjS-yTaS — SaS^i3.y + SjBsTa-y) . 



(3.108) 



This is like the Q map introduced in ( |2.33[ ) but with general coefficients {a,b,c). The proof 
is somewhat tedious and proceeds by considering every Ck separately. 
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(1) C\Ck = X^: it is trivial to see that 2^^(r) = T and that this is a generaUzed Q map with 
coefficients 

a = l 6 = c = 0. (3.109) 

(2) C\Ck = Q^: to re-express Q^ we first calculate the various pieces, 



Q(r)„^ 
l(r) 



Od'yi- 



M-N -I 
N{N -I) 
{M - N){M - N - 1) 



N{N-1) 



M -N - 1 
N- 1 



■ a-f ) 



(3.110) 
(3.111) 



Substitution into Eq. (2.33) leads once again to a generalized Q map with coefficients: 

2N- M 



, 4Af2 + 2N - 4NM + M^ - M 

" = ' ^= N^N-iy ' 



(N-l)' 



(3.112) 



(3) Cl.Ck = G^G- using the same strategy one finds on the basis of Eq. (2.38) 






wy ■ 



Substituting this into (3.2) leads to another generalized Q map with coefficients: 

2N -M -2 
a = 4: 6 = c 



{N-iy 



(3.113) 



(3.114) 



(4) Cl.Ck = 7^ 71: the terms needed by (3.4) are found by tracing (2.46): 

"^1 i^)al3;jS 



(M - 4)r,^.^5 - 

'M- N-2 



M -N -2 



{^a^y^pS - ^ai^p-y)'^ 



Ti (r)„^ 
Ti(r) 



N -1 

{M - N - 2){M - N - 1 



N{N - 1) 

(M - 3)(Af - 2iV) 



Oa^fi 



N{N - 1) 
(M - 2)(M(M - 1) - 3N{M - N)) 
N{N-l) 



N-1 



(3.115) 
(3.116) 

r„-,(3.ii7) 

(3.118) 



and substitution into Eq. (3.4) leads to the coefficients: 



a = M -A , 

M3 - 6M^N - 3M2 + 12MiV2 _^ i2MN + 2M - ISN'^ - &N'^ 

3iV2(^r_i)2 

M2 + 2iV2 _ 4^^ _M + SN -A 
^ ~ 2(A^-1)2 • 
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(5) Cf^Ck = 72/2: for the contribution of the T2 condition to the overlap matrix we need 
the terms derived from Eq. (2.49): 



M-N- - 

Ti {^)al3-n& = N -I ^^'^'^"^ ^ ^pS^ay -{M - 2)VaS-yl3 

'M{M -N)-{N - 1)(M - 2) 



7-2 (r) 



07 



N -I 



i 07 + 007-1- ' 



which, when substituted into Eq. (3.7) give the following coefficients: 

2 27V2_^(M-2)(4A^-3) -M2 



a = 5M - 8 



iV-1 



2(iV-l)2 



(3.119) 
(3.120) 
(3.121) 

(3.122) 
(3.123) 



(6) C\Ck = T2 Ti- finally, for the 7^' condition, the contribution to the overlap matrix is 



almost the same as for the regular Ti- The extra terms in (3.12) just add some simple terms 
to a and c: 



a = 5M - 4 



2A^2 + (M - 2)(4iV - 3) - M2 - 2 



(3.124) 



then applying partial trace operations on Eq. (3.108) leads to: 



Q 

a + M{M - l)b-2{M -l)c ' 



■ 07 



Q 



b{M-l) 



a-y 



a + M{M - l)b - 2{M - l)c 



a - c{M - 2) 

Upon substitution into Eq. ( 3.108| ) and solving for T one obtains, 

T = Q-\a,b,c)iQ) = Qia',b',c')iQ) , 



Oct-ylc^ 



N-1 2(iV-l)2 

The overlap-matrix map is just the sum of the various terms obtained, and hence itself a 
generalized Q map, with rather complex coefficients. 

Inverse map of a generalized Q map The inverse of a generalized Q map can be shown 
to be another generalized Q map. Consider for brevity the notation: 

Q{a,b,c){T) = Q , (3.125) 

(3.126) 
(3.127) 

(3.128) 

(3.129) 
(3.130) 

^ a[c{M-2)-a] ' ^^'^^^^ 

These are important relations since they allow to evaluate the action of the inverse overlap 
matrix on a two-particle matrix as fast as a Q map. i.e. at a computational cost which is 
negligible compared to the other matrix manipulations. 



where 



1 

a 



b' 



ba + bcM - 2c^ 



a [c{M -2)-a][a + bM{M - 1) - 2c(M - 1)] 
c 
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3.4.3.3 Solution to the equations of motion 



In this Section we show how to solve Eqs. (3.101 ) and (3.102). This is done by decoupling the 



primal from the dual variables, using the fact that the primal and dual step are orthogonal 



to each other (3.104). We start by deriving the dual step Az from the dual equation. 



Solution to the dual equation If we project the dual equation (3.101) on the space 

(3.132) 



spanned by the non-orthogonal basis {u*} (which we will call ZY-space) we obtain: 

(Tr D-^uW-^u') 6-fj = Tr Bu' , 



j 



K 



in which the right-hand side of Eq. (3.101) is denoted by 5, and the linear equalities for the 



primal and dual step (3.103) have been used. The form of the equation can be seen to be 



identical to the Newton equations obtained in the dual-only program (3.48), which means it 



can be solved in exactly the same way, using the linear conjugate gradient method, without 
explicit construction of the dual Hessian matrix l-L or any reference to the non-orthogonal 
basis set {u*}. The right-hand side of Eq. (3.132 ) can be transformed to a two-particle matrix: 



J^Tr [Bv^] r = Y,Y.^^ [BkCin]f = PT, 



E4(^fc) 



(3.133) 



where B^ are the different blocks of B corresponding to the various matrix maps. In 



Eq. (3.133) a matrix B, block diagonal in the different constraint spaces, is mapped on a 



traceless two-particle matrix, this is called the collapse map from now on. The left-hand 



side of Eq. (3.132) has exactly the same structure as the Hessian in the dual only program 
(3.53). As a result, one can express the action of the dual Hessian matrix Ti on an arbitrary 



traceless two-particle matrix A as: 



n^A 



PTr 



E4(VA.(A)Z),i) 



(3.134) 



in which the D^ are again the blocks of the D matrix corresponding to the different constraints 



£fc. Once the solution, Agoi, to this system of equations (3.132) is found we can construct the 
dual step as: 

Az = 0A(A,oi) . (3.135) 



It is interesting to see that the right-hand side of the dual equation (3.132) is exactly the 



same as the negative gradient in the dual-only potential reduction program: 



Tr 



u 



^z-i 



X 



n 



n 



Tr [Z 



-W] 



h' 



(3.136) 



with t = —. The Hessian however, is only identical when D = Z, which is the case when 
X ~ Z~^, i.e. when {X, Z) is on the central path. When we deviate from the central path, 
this Hessian will use information of the primal SDP to find a better direction toward the 
central path and the optimum. 
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Solution to the primal equation The solution to the primal equation (3.102) is obtained 



in a similar manner, by projecting this equation on C-space spanned by matrices {c*}, which 
is the orthogonal complement of U-space. With B again denoting the right-hand side of 
Eq. (3.102) and making use of Eq. (|3.103) one gets: 



"^{Ti D c' D d) 6xj = Tr Bc\ 



K 



where we have used 



A 



X 



^(5xi 



(3.137) 



(3.138) 



This is again a symmetrical positive-definite system of linear equations that can be solved 
iteratively using the linear conjugate gradient method, without explicit construction of the 
Hessian matrix T-L^ ^ or any reference to the basisset {c*}. Indeed, l-L^ can be seen as a map 
from C-space on itself, since for an arbitrary matrix in C-space: 



E 



CiC 



(3.139) 



the image of e under the primal Hessian map is 



n^e = Pc [DeD] 



(3.140) 



in which Pq is the projection on C-space. This projection can be executed quickly by using 
the inverse of the overlap matrix of the Z//-space basis vectors. Suppose we have an arbitrary 
block matrix A of the same dimension as X and Z. First we project it on the space spanned 
by the basis {n'^,n*} = {^i"}. The projected matrix A' reads as: 



A' = ^Tr [Au-]{S-') 



al3 



(3.141) 



aP 



where the overlap matrix S appears because of the non-orthogonality of the basis. As shown 
in Section [3. 4.3. 2| the inverse overlap matrix can also be considered as a map from two-particle 
space on itself. The projected matrix A' can now be written in block-matrix form as: 



^ -^^J 



'5-'fE4(^^)) 



(3.142) 



To project A on Z//-space we still have to remove the component along the u'^-matrix: 



Pu{A) = A' 



Tr u^A' 
TV u^u^ 



u 



(3.143) 



Since C-space is the orthogonal complement of the W-space, the desired projection of A on 
the C-space is simply given by 

Pc{A) = A-Pu{A) . (3.144) 

Note that the right-hand side of the primal equation is the negative gradient of the potential: 

(pp{X) = Tt Xu° -tlndetX , (3.145) 



with t = — , and the Hessian is the same when {X, Z) is on the central path. 
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3.4.3.4 Outline of the algorithm 

In this Section a short outline of the algorithm is presented. The first step is to initialize 
the primal-dual variables, after which they are directed towards the central path. Then the 
actual minimization of the primal-dual gap takes place, which is done in a predictor-corrector 
loop. 

Initialization We need a feasible primal-dua l star ting point. An initial feasible dual point 
Z^^' , i.e. a matrix that satisfies the inequality (3.39), is easily found by setting 



zw 



u 



(3.146) 

which corresponds to setting al the 7j's equal to zero. To construct a feasible primal starting 
point we take a completely random matrix X and project it on a matrix X' which satisfies 
the primal equality constraint: 

Tr X'u' = K- . (3.147) 

This is achieved using the inverse overlap matrix of the {m°} basis, 

X' = X - ^ (Tr Xu° - /i") S-^t 



lU 



(3.148) 



X-L 



The last term on the right-hand side can be computed as: 



X' 



\j)^j 



'5-ME4(^^)-^ 



(2) 



(3.149) 



At this point, X' satifies the equality constraint (3.147), and one just has to add u^, with a 



positive scaling factor that is large enough to ensure positive semidefiniteness: 

X(°) =X' + au° ^0 . (3.150) 



Centering run Before the actual program can be started, a couple of centering steps have 
to be taken, which is done by solving the equations (3.101) and (3.102) with u = 1. The 



purpose is to go sufficiently near the central path, without bothering about the primal-dual 



gap. In a first step Eq. (3.132 ), which has the smallest dimension, is solved using the conjugate 



gradient method, and the dual solution Az is obtained. The primal solution Ax then follows 



from the dual equation (3.101) by substitution. For these initial centering steps, both linear 



systems are so well conditioned that hardly any iterations are needed for convergence. As a 
measure for the distance from the center we use the potential |53] : 

<^{X,Z) = -liidetX -IndetZ , (3.151) 

which is obviously minimal (for points with the same primal-dual gap ry = Tr XZ) on the 
central path. The potential then has the value: 

V 



n 



$(X^Z^) = -nln 

When the potential difference (which is always positive): 
*(X,Z) = $(X, Z) - $(X^ Z'^) 

= nlnTr XZ — nlnn — IndetX 
is sufficiently small, the centering run is stopped. 



In det Z 



(3.152) 



(3.153) 
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Predictor-corrector run In this part of the program the primal-dual gap is minimized 
by alternating predictor and corrector steps. A predictor step tries to reduce the primal-dual 
gap by solving the equations (3.101 ) and (3.102) with u = 0. This is done in exactly the same 



way as for the centering run, by first solving (3.132) for Az, then substituting into (3.101) 
to obtain an approximate primal step A^. The final primal step Ax is obtained by solving 



(3.137) using the conjugate gradient method with the approximate Ax as a starting point. 



Note that when the primal-dual gap decreases, the condition number of the primal and dual 
Hessian matrices increases and more iterations are needed before convergence is reached. One 
can adjust the convergence criteria of the primal and dual conjugate gradient loops, in order 
to minimize the combined number of iterations. 

At this point we have a predictor direction (Ax, A^). The logarithmic potential 



e{a) = ^{X + aAx, Z + aAz) , 



(3.154) 



in the predictor direction (see Eq. (3.153)) can be simply evaluated (analogously to the line 
search function in the dual-only program) for any value of a by solving two eigenvalue equa- 
tions: 



X ^AxX 2 ] vx 



X"" vx 



and 



Z -iAy.Z 2 I vz 



\^ vz . 



Using these eigenvalues one can write: 

In det {X + aAx) = In det 

= lndetX-F^ln(l + aA, 



X^ (l + aX~^2AxX-^\ X\ 



(3.155) 



(3.156) 



and thus simply rewrite Eq. (3.154) as: 
e{a) 



'f(X,Z) + ln[l + a(cx + cz)] 

- ^ ln(l + a\f) - ^ ln(l + aA| 



(3.157) 



1. 



1, 



(3.158) 



where 

cz = -Tr XAz and cx = -Tr ZAx 
T] rj 

With a standard bisection method one can now compute the stepsize a corresponding to the 
maximal deviation from the central path we want to allow. 

After the predictor step, a corrector step is taken, which is equivalent to the centering 
step described previously. The alternation of predictor and corrector steps continues until the 
primal-dual gap is smaller than the desired value. 



3.4.3.5 Adding linear inequality constraints 



Adding linear inequality constraints of the form (3.70) to the primal-dual algorithm is much 



more involved than for the dual-only algorithm. As described in Section 3.4.2.3[ the only 
change in the primal and dual SDP is that an extra term is added to the matrices {tt"}, as 



in Eq. (3.71). This means, however, that the overlap matrix of the set {«"} is modified, and 



52 



CHAPTER 3. SEMIDEFINITE PROGRAMMING 



the formulas derived in Section 3.4.3.2 cannot be used. The extra terms in the overlap matrix 
are simply: 

S!:p = Y,L{nnL{fh, (3.159) 

i 

and as a result the action of the overlap matrix on a general two-particle matrix can be 
written as an extended generalized Q-like map: 

QL{a, b, c) (r)„^.^5 = aTai3;^s + b {Sa-ySiss - SasS/s-y) r (3.160) 



We now show that the inverse map can still be constructed, at the cost of solving a system 
of linear equations with dimension 2m + 1 (with m the number of inequality constraints). As 
before we invert the equation: 

QL{a,b,c){T) = Q , (3.161) 

by calculating the partial traces: 



Q 



ay 



[a - c{M - 2)] r,^ + 6ajT [b{M - 1) - c] + - J^ (vcn C^ , (3.162) 



Q = [bM{M-l)-2c{M-l) + a]T + -S2(rCf)Cf 



(3.163) 



We cannot solve this system, because we have three equations but m + 3 unknowns, so we 
need more equations. We can derive m extra equations by: 



QC? = E 



\{cfC0)+a6.j 



rc^j -4TCf + 2Cfr, 



(3.164) 



but we get m new unknowns: 



^Cf = Y.T^Jcf 



a-y 



ya 



If we add m more equations of the form: 



1 



QCf = [a- c{M - 2)] rCf + Cf [b{M - 1) - c] T + - ^ (fC^) Cf ^ , (3.166) 



(3.165) 



we finally have a closed system of 2m + 1 equations and 2m + 1 unknowns: 

^\ /a icf o\/r\ 



QCO 



2Cl \C^,C]_+a5,, -4 



Qcy \ncf icfq 6.,.J 



TCI 



where 



(3.167) 



A = bM{M - 1) - 2c(M - 1) - a , 
K = a- c{M - 2) , 
H = 6(M-l)-c. 



(3.168) 
(3.169) 
(3.170) 
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ai Q + ^4 QCO + Y^alQCf = T , 



This system can be solved numerically to obtain the relations: 
a(Q) 



f3iQ + Y.Pi' QC^ + ^Pi' QCf = rq , 



li 



Q + E^i'QC^ + E^i'Qcf = rcl 



The calculated numbers can be substituted: 

r«7 = -Qa, - -SaMQ) -]-Y^ /3'{Q)Cf , 

i 

and finally yield the inverse of the extended generalized Q map: 

1 



(3.171) 
(3.172) 
(3.173) 

(3.174) 



QL^{a,b,c){Q)ap.-y6 



Qal3;'r5 {b-\ I {SajSps - SaS^py) a{Q) 

d CI \ K/ / 



+- 



c 



Aku 






~E(^'(Q)cf = T. 



a/3;7(5 ■ 



(3.175) 



This inverse overlap-matrix map is needed if we want to project on W-space, defined by the 
{ii*}'s with added terms for linear inequalities. The form of the projection is exactly the 



same as before (3.141 ), but with the old overlap matrix replaced by the new one (3.175 ). The 



solution to the primal system now changes, in that the projection operator on C space (3.144), 



appearing in the definition of the primal Hessian, uses the changed projection operator on 
U-space. 

For the solution to the dual equation, the changes are exactly the same as those men- 



tioned in Section 3.4.2.3 There are extra terms added to the collapse map due to the linear 
inequalities: 



J^TV [W]r = PTr 



E4(^^o + E^'^: 



(3.176) 



in which the Bj" are numbers on the diagonal corresponding to the i'th linear constraint. The 



modifications to the dual Hessian map are exactly the same as those in Eq. (3.77): 



iD 



n'^A = PTr 



E4(^."^A(A)z),-i) + E 



L k 



VoP 






'kk 



(3.177) 



3.4.3.6 Adding linear equality constraints 



Adding linear equality constraints to the primal-dual algorithm is quite straightforward. For 
the solution of the dual problem the changes are exactly the same as with the dual-only 
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program, i.e. the Pxr appearing in Eqs. (3.133) and (3.134) is replaced by a Pf as defined in 
Eq. (3.86). For the solution to the primal algorithm the only modificiation is the projection 
on W-space, more specifically the removal of the component along uP (3.143). For this we 



have to introduce several matrices: 



and orthogonalize them: 



u„ 



i;^ L.i[Efi) , 



i°n 



*V1 \ / 



u„ 



with 



(Su) 



Tr Ku'i] 



The new projection is then defined as: 



(3.178) 



(3.179) 



(3.180) 



3.5 Boundary point method 



The interior point methods discussed in the previous Section 3.4 are very stable and always 
converge to the desired accuracy. A disadvantage, however, is that at every point a Newton- 
like system of equations has to be solved. In the algorithms presented earlier we have avoided 
solving this system explicitly by using iterative methods and a fast matrix- vector product. 
The number of iterations needed to solve these equations, however, explodes when we get 
close to the optimum, because the system becomes ill-conditioned. This occurs because, on 
approaching the edge of the feasible region, the matrices involved have near zero eigenvalues, 
causing the condition number of the Hessians to diverge. This limits the size of the sys- 
tems that can be studied. Another, completely orthogonal, approach is the boundary point 
method, developed in [591 160] . where one remains on the hypersurface defined by the com- 



plementary slackness condition (3.17), and moves towards the feasible region. This method 



was developed for problems where the number of variables is so large that standard SDP 
algorithms don't work anymore. An implementation of this algorithm for variational density 
matrix optimization was presented in |31j . 



3.5.1 The augmented Lagrangian 

The boundary point method is actually an instance of the more general class of augmented 
Lagrangian approaches for solving convex optimization problems. The standard Lagrangian 
for the SDP: 

min^7i/i* u.c.t. Z = u° + ^7in\ (3.181) 



introduces the Lagrange multiplier matrix X 

L{j,Z;X) = j'^h + TT 



xlz-u^-Y,7^A 



The augmented Lagrangian for Eq (3.181) adds a quadratic penalty for infeasibility: 



u 



+ ^ii^--°-E^^' 



.«l|2 



(3.182) 



(3.183) 
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where a parameter o" > determines how strong the penalty is. If we introduce the new 
matrix variable: 

Wij) = u° + y ^iu' - -X , (3.184) 



a 



we can rewrite the augmented Lagrangian as: 



L^{j,Z;X) = f{j,Z)-—\\Xf 



with 



f{^,Z) = Yh+-\\Z-W{j)r . 



(3.185) 



(3.186) 



The idea of augmented Lagrangian methods for solving SDP's is to perform the optimization 
in two stages: first minimize /{■y, Z) under the constraint Z ^ 0, keeping X constant; then 
update X in some way until convergence is reached. 



3.5.2 Solution to the inner problem 

The inner problem is a quadratic SDP: 

min/(7,Z) u.c.t. Z^O 

for which the Lagrangian reads as: 

L{^,Z;V) = f{^,Z)-TTVZ , 



(3.187) 



(3.188) 



with Lagrange multiplier V for the constraint Z y 0. The optimal point has to satisfy the 
conditions: 






W - aTr 



Z - n° - ^ 7ju^' + -X 



0, 



VzL = a[Z - W{-f)] - y = , 

Z h Q , ^^0, ZV = Q . 



(3.189) 

(3.190) 
(3.191) 



Keeping Z constant, the optimality condition for 7 (3.189) can be fulfilled by solving the 
linear system: 



^Tr [^V]7, 



h' 



Tr 



u'[Z-u^ + -X 
a 



(3.192) 



It is clear that (3.189) holds when the sum of the primal and the dual equality constraints 
are satisfied: 



Tr [Xu'] +Z = h^ + u^ + ^ 



liU 



(3.193) 



so the solution of the linear system (3.192) can be seen as a projection on the feasible plane. 



If, on the other hand, one keeps 7 constant, the optimization of (3.186) under the condition 



that Z ^ is just a projection on the cone of semidefinite matrices. Decomposing 1^(7) into 
a positive and negative parl|^ as: 



W{^) = W{^)+ + W{^)^ 



(3.194) 



''The positive part of a symmetric matrix with spectral decomposition A = ^^ \iXiXf is obtained by 
restraining the sum to positive eigenvalues A^ only, and vice- versa for the negative part. 
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the optimal Z with constant 7 is: 

Z = W{j)+ . (3.195) 

For V to satisfy the optiniahty conditions, it must be equal to: 

V = -aW{-f)- , (3.196) 

for which it can be seen that the conditions 

V = (j[Z-W{j)] , VZ = and VhO, (3.197) 

are automatically fulfilled. Projecting on the feasible plane, followed by a projection on the 



cone of positive definite matrices, brings us closer to the optimum, as shown in Fig. 3.2 We 



can repeat this procedure until convergence is reached, i.e. until the linear equations (3.192) 



are satisfied after projection on the positive cone, to within the prescribed accuracy e. This 
convergence criterion: 

h' - aTi [u' {Z - W{-f))] < e , (3.198) 



can be rewritten using Eq. (3.197) as: 

h' - aTv [u'V] < e 



(3.199) 



As a result, the Lagrange multiplier V is approximately primal feasible after the inner loop. 
The obvious update for X in the outer loop is: 



X ^V , 



(3.200) 



and one can consider the inner loop as a projection on primal feasibility. The program is 
finished when both the primal and the dual are feasible; it follows that the convergence 
criterion for the outer loop is dual feasibility within the preset accuracy. 

3.5.3 Outline of the algorithm 

It is important to realize that throughout the program the primal and dual variables are not 
feasible, i.e. that 7 and Z are independent. The algorithm itself is short and simple, and is 
given in pseudocode in Algorithm [2j At first sight the computationally most demanding step 
in the algorithm is the solution of the linear system Eq. (3.192 ). However, it can be seen that 
when rewriting the system in traceless two-particle matrix space, with 7 = YliliP'- 



57 = Pxr 



-iF(^) + j;4(z. 



1^(7). 



(3.201) 



where S is the overlap matrix defined in Section 3.4.3.2 limited to traceless space. The 
generalized Q map transforms traceless matrices into traceless matrices. This implies that 
the inverse of the overlap matrix limited to traceless space is exactly the same as the inverse 
of the overlap matrix on full space. The system can thus be solved easily by applying the 



inverse overlap matrix, as defined in Section 3.4.3.2 to the right-hand side of Eq. (3.201). 



The most expensive step in the algorithm is the separation of the matrix W into a positive 
and negative part, for which an eigenvalue decomposition is necessary which scales as 0(71"^) 
in the dimension of the matrix. If only two-positivity conditions are applied this leads to a 
scaling of M^ per iteration, for the three- index conditions the scaling per iteration is M^. 
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Figure 3.2: A schematic representation of how the optimal point is reached by subsequent 
projections on the feasible plane and the semidefinite cone. 



Algorithm 2 The boundary point algorithm 



Choose ein > , eout > and a > 
X° = 0; Z° = 0; A; = 
while 6out > eout do 
while 6in > ein do 

Solve Tt [u'u^] -ff^ = -^ + Tr [u' {Z - u'^ + ^X)] for j^^^ 



5in = Ei V(Tr [n^FW] - h^y 



end w^hile 

X(k+1) = yik) 

k ^ k + l 

6out = WZ^"^ - u^ 
end ^vhile 



Ei liu' 



> primal infeasibility 



> dual infeasibility 
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3.5.4 Adding linear equality and inequality constraints 

The addition of linear inequality constraints to the boundary point method is quite analogous 
to the primal-dual algorithm. The only modification is that the overlap matrix takes on the 



form (3.160). For the solution of the linear system (3.201) we now let the inverse (3.175) 
act on the right-hand side of Eq. (3.201), in which evidently the constraint matrices Cf are 



replaced by their projection on traceless matrix space. 

For the equality constraints this trick no longer works, and we will have to solve the 
linear system (3.201) using the conjugate gradient method. The overlap-matrix map which 



repeatedly acts on the right-hand side of Eq. (3.201) is defined as: 



5 (T). 



a/3;7(5 



Pf 



-C [5a-yTps — 5p.yTa5 — ^aS^ ji^i + ^pS^a-y)] 



(3.202) 



in which the factors (a, 6, c) are determined by the applied constraints, and Pf is given by 
Eq. ([3l86|. 



3.6 Computational performance of the methods 

A general comparison between different algorithms is difficult, as the performance of the 
algorithms can depend on the specific system being studied. There are also ways in which 
an algorithm can be tweaked to converge faster, by making adjustments which are sometimes 
logical, and sometimes don't make sense. A comparison of different algorithms is therefore 
inherently biased towards the algorithm one spent most time optimizing. This being said, 
this Section contains an honest comparison of the various algorithms developed and tested 
during the course of my PhD, and an attempt to explain in some detail what the bottlenecks 
are in the different methods. 



3.6.1 Interior point methods 

As mentioned earlier, for interior point methods the bottleneck is the solution to the Newton- 
like Hessian equation that has to be solved at every iteration: 



W^5j 



(3.203) 



The dimension of the system is the number of free variables in the optimization, which scales 
as the number of elements in a 2DM, being M^. General methods to solve linear systems 
{e.g. using LU or Cholesky decomposition) scale as 0{n^)., which means an algorithm based 
on a direct solution of Eq. (3.203) scales as M^^. An explicit construction of the Hessian 



matrix would be even more demanding, as for every element some elementary M^ matrix 
calculation has to be done, resulting in a scaling of M^'^. This huge computational cost 
can be avoided by referencing the Hessian matrix only through its action on a 2DM and 
resorting to iterative methods, like the linear conjugate gradient method, to solve the system 



(3.203). At every iteration of the conjugate gradient method, a matrix-vector product has to 



be performed, which scales as M^ for two-index conditions and M^ for three-index conditions. 
The drawback to this iterative approach is that the efficiency of the solution, i.e. the number 
of iterations needed to converge, depends on the condition number of the Hessian. For interior 
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Figure 3.3: The number of iterations needed by the hnear conjugate gradient method to solve the 
inner Newton problem as a function of the primal-dual gap, for both interior point methods, applied 
to a 10 site ID-Hubbard model with 10 particles. 



point methods, the condition number diverges near the edge of the feasible region, while a 
point near to the edge is reached quite rapidly, gaining those few extra digits in precision 



becomes very demanding. In Figure 3.3 we have plotted the number of iterations needed for 
the linear conjugate gradient method to converge to the desired accuracy as a function of the 
primal-dual gap, for a 10-site one-dimensional Hubbard model with 10 particles. One can see 
that, for both the primal-dual and the dual-only method, the number of iterations increases 
drastically when the primal-dual gap decreases. For the dual-only method, a free parameter 
that can be tweaked to optimize the total number of iterations, is the factor by which the 



potential parameter t (see Eq. (3.46)) is scaled down after every Newton loop. If we choose 
this factor to be large, many more Newton steps are needed to converge to the optimum of 



the next t value, and the number of iterations in Figure 3.3 is much higher. In the primal-dual 



method, how much the primal-dual gap is reduced per Newton step is determined by how 
far one can step in the predictor direction without exceeding the maximal deviation from the 
central path (See Eq. ( 3.157| )). One can see in Figure 3.3 that there are more green points 
than red points, which means that there are more Newton steps needed in the dual-only 
method than in the primal-dual. In general the primal-dual method will require less Newton 
steps to converge than the dual-only approach, due to the fact that information of both the 
primal and the dual problem are used to determine the predictor direction. 
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Figure 3.4: The evolution of primal and dual infeasibility during the convergence of the boundary 
point method. 



3.6.2 Boundary point method 



The boundary point method is in its approach completely orthogonal to the interior point 
methods, and it therefore has very different numerical properties and difficulties. A key 
advantage is the absence of a divergence of the condition number of some linear system. 
A disadvantage is that the method is much less stable, and its convergence properties are 
more dependent on the systems studied than for interior point methods. An important 
free parameter in this algorithm is the prefactor a to the dual infeasibility penalty in the 
augmented Lagrangian (See Eq. (3.183)). This parameter has to be chosen carefully and 
updated during the algorithm to ensure balance between primal and dual infeasibility. In 
theory one lets the inner loop converge to a certain precision ein- In practice the algorithm 
turns out to work best when only one inner iteration is performed. After every outer iteration 
the primal and dual infeasibility are checked. If the dual infeasibility is larger than the primal 
infeasibility, a is multiplied by some factor r > 1. If the dual infeasibility is smaller then the 



primal, a is divided by r. In Figure 3.4 the convergence behaviour for the boundary point 
method is shown. It is seen that there are two different regimes during convergence. First 
the primal and dual infeasibility are not balanced and they do not decrease monotonically, 
but rather oscillate in a chaotic manner. Once they have become balanced, the convergence 
towards the feasible area proceeds in a more stable, monotonic way. 
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3.6.3 Comparison of scaling properties 

To conlude the Chapter, some scahng properties of the different methods are compared, 
when apphed to the one-dimensional Hubbard model. All the methods have the same basic 
computational scaling behaviour, being O(M^) required for multiplying or diagonalizing a 



matrix. In Figure 3.5 the number of these operations needed to converge to the optimum 
is plotted as a function of lattice size. The interior point methods both have to solve a 
linear system of size M^, so it is not surprising that the scaling, on top of the basic matrix 
computations, of these methods is M"^. More surprising is that there seems to be no, or 
a very limited, scaling for the boundary point method. The number of iterations required 
remains around 3000, irrespective of the size of the system. It must be stressed that this is 
a result limited to the one-dimensional hubbard model, and cannot be extrapolated to other 
systems, like molecules, where the convergence properties of the boundary point method can 
be completely different. One reason for the succes of the boundary point method applied to 
the Hubbard model is the amount of symmetry present in the system. The boundary point 
method is designed for problems with a huge amount of dual variables or primal constraints. 
For most physical systems the dimensions of the matrices involved are already unfeasibly 
large before the boundary point method would becomes advantageous. The Hubbard model, 
however, contains many symmetries (see Chapter El), implying that the matrix dimensions are 
considerably reduced, and the number of dual variables can get very large before the matrix 
computations involved become unfeasible. In this case, the domain where the boundary point 
method is advantageous is actually reached. 
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Figure 3.5: The number of basic matrix operations needed to converge for different lattice sizes of 
the one-dimensional Hubbard model. 



CHAPTER 4 



Symmetry adaptation of the 2DM 



In the previous Chapter we have shown how the variational determination of the density ma- 
trix can be formulated as a semidefinite program. The current implementations of semidefinite 
programming algorithms, however, are not yet competitive with other many-body methods 
of comparable accuracy (like the coupled-cluster framework with single and double excita- 
tions). In this Chapter it is shown that including symmetry in the v2DM algorithm is a 
straightforward, though sometimes tedious, task, and leads to a considerable speedup of all 
computations. A system is said to have a certain symmetry when it is invariant under the 
action of some operator O. In quantum mechanics, this is expressed mathematically by the 
fact that the Hamiltonian operator describing the system commutes with the operator O: 

[i^,O] = 0. (4.1) 

As we will see, this implies a partial (block)-diagonalization of all matrix quantities involved. 
In this Chapter an overview is given of the different symmetries occurring in the systems 
under study, and how these were exploited. The first Section deals with the inclusion of spin 
symmetry, which is present in all the atomic, molecular and lattice systems that have been 
considered. Then additional symmetries are discussed that are more specific to the different 
systems, for instance the symmetries present in atomic systems (rotational symmetry and 
parity). The last Section deals with the symmetries in the one-dimensional Hubbard model, 
i.e. translational invariance combined with spin and parity. For a more complete account 
of symmetries and group theory in physics we refer to |61l I62j : for a basic overview of the 
angular momentum algebra used here, see Appendix [B| 

4.1 Spin symmetry 

Spin symmetry is one instance of the much larger class of symmetries arising from the in- 
variance of a system under rotations, either in real space or in some abstract spin space. 
As is known from basic group theory [61], the generators of a rotational group are angular 
momentum operators {Jx-,Jy,Jz) which satisfy the Lie algebra: 
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The invariance of a Haniiltonian under a rotation then translates into the conservation of 
angular momentum: 

[H,J,]=0, [H,J'] = 0, (4.3) 

where 

J^ = J^ + Jl + Jl . (4.4) 

From Eq. ( 4.3 ) and the fact that [J'z , i7^] = it follows that the eigenstates of the Hamiltonian 



can be labeled by the eigenvalues of J' and J'^: 

-Jm) = ^JM\'^JM 



H\^>^j^) = E^^\^^j^) , (4.5) 



with 



r\nM) = JiJ+^W}M), (4.6) 

jm) = M^JM 



J.I^^m) = M\^^m) ■ (4-7) 



We now expose how this symmetry can be used to speed up the algorithm by transforming 
the 2DM and its matrix maps into block-diagonal matrices. 

The most frequently encountered symmetry of this kind is the symmetry under rotations 
in spin space. Physically this can be interpreted as the absence of a preferred direction for 
the axis of spin quantization. All atomic, molecular and lattice systems that were studied in 
this thesis have this symmetry. This symmetry would be broken if e.g. an external magnetic 
field is added to the Hamiltonian. We rewrite the single-particle basis to explicitly introduce 
the electron spin: 

\a) -^ \aaa) , (4.8) 

in which a is a spatial orbital index and aa = ±2 is the spin projection. To exploit this 
symmetry we transform the antisymmetric two-fermion basis, in which the 2DM is expressed, 
to a spin-coupled basis (See Appendix [B] or p3|): 



\ab;SMs) = 5^(^aa^^6|SM5)|aaa6afe) . (4.9) 

Two spin-^ particles can couple to a spin 5 = singlet and a 5 = 1 triplet: 



\ab;00) = ^[|atH)-|aiH)] , (4.10) 

|o6;l-l) = |at6t) , (4.11) 

\ab;lO) = ^[|atH) + |ai6t>] , (4.12) 

\ab;l + l) = \albl) , (4.13) 

which are respectively antisymmetric and symmetric in the spin-coordinates. Combining this 
with the permutation symmetry of the electrons the norm of Eq. (14.9^ is not unity but: 



{ab; SMs\cd; S' M's) = Sss'hisM', [^ac^u + {-if^adhc) , (4.14) 

which means we have to define the spin-coupled (see Appendix [BJ) two-particle creation op- 
erators B^ as: 

^^ k ^ 41!, = -7^ Y.(l^-l''b\SMs)al^al , (4.15) 



B^T 
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where (06) is shorthand for (1 + dab) (see Appendix [A]) . In this spin-coupled basis, the 2DM 
for an ensemble of states with spin S and spin projection 5^ = A^ is given by: 



^ab;cd - Z^Wi\'^SM,i\^ ah ^cd \^ ■ 



SM.tl 



(4.16) 



For the following it is convenient to rewrite Eq. (4.16) in a form where the direct product 



of the operators B'^ and B is coupled to a spherical tensor operator with total spin St (see 
Appendix |B]) : 

SM^SMs,s'M^ = {-lf-''sY,w.Y.^SMsS'-M's\STMT) 

i StMt 



(^ 



N I 
SM,i\ 



St 



ab 






St 

Mt 






(4.17) 



As the bra and ket wave functions in the ensemble have the same z-component of total spin 
M, the matrixelement in the above equations can only be non-zero if Mt = 0, which means 
that Ms = Mg-. 



ab:cd 



JMsM' 



_^f-Ms Y^ ^^ Y^iSMsS' - Ms\StO) 



St 



{^SM,\ [ 



B^ab' 



Bed 



1^ 



N X 
SM,il 



(4.18) 



For a further reduction of the 2DM we first consider the case where the total spin of the wave 
function is zero. 



4.1.1 Singlet ground state 



When the ground state is a spin singlet (5 = 7W = 0), the operator in Eq. (4.18) has to be a 
singlet too {St = 0), for the matrixelement to be non-zero. It follows that S = S' and: 



00 SMs;S'M's 
abicd 



->MsM' 



6ss'{-lf~^'' Yl MSMsS - M5IOO) 



The Clebsch-Gordan coefficient 

{SMsS - Ms\00) 



(* 

(_1)S-Ms 

Is] 



N I 
00,il 



fit 



ab ' 



implies that Eq. (4.19) can be written as: 

■,S'M' ^MsM'g^SS' 



00-pSMs 
abicd 



[S] 



^Wii^ 



fi 



cd 



^^I,.) • (4.19) 



(4.20) 



N I 
00,il 



fit 



ab 



B: 



cd 



1^ 



N y 
00,i/ ' 



(4.21) 



which is independent of Ms. As a result there is a decomposition of the global 2DM into four 
diagonal blocks, one block with 5 = 0, and three with S = 1. Those with S = 1 are identical, 
so all matrix manipulations can be restriced to one copy. It follows that the basic object that 
has to be stored can be written as: 



00-r.SMs;S'M'g 
abicd 



p5 

-"- ab;cd ' 



(4.22) 



as is illustrated in Figure 4.1 
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4.1.2 Higher-spin ground state 

It turns out that such an economical block decomposition is also possible for higher-spin 
ground states, provided we consider spin-averaged ensembles: 



S SMs;S'M'g _ 
^ab;cd - '^MsM'gK 



-1 



sS'-Ms 



Y,w.^{SMsS' - Ms\StQ) 



[S] 



'^ M 



St 



I* 



N V 
SM,il ' 



(4.23) 



in which an equal weight ensemble is taken over the different members of a multiplet l^^^vj i)- 
We are still able to reach the lowest energy by considering such ensembles, because the differ- 
ent members of a multiplet are degenerate when the Hamiltonian has spin symmetry. Using 



the Wigner-Eckart theorem (B.19) we can rewrite the coupled matrixelement in Eq. (4.23) 
as: 



(*, 



N I 
SM,i\ 



i?t 



S 
ab 



B. 



\^SM, 



(-1) 



S-M 



(^ 



S S St 
M -M 

s 



N I 
S,i\ 



B\b^B^d 



St 



1^ 



N X 
S,i) 



Substituting 



(-1) 



S-M 



[S] 



S S 
M -M 



the sum over Ai can be used in Eq. (4.23), together with the orthogonality relation: 

M ^ 
to obtain: 



0\ ( S S St 
M -M oj \M -M 



hj 







S^SMs;S'M'g 
ab;cd 



1 



^MsM's^SS'JgVF^^Y.'^^ ^^s,i\\ [BKb^B, 



)S 







1^ 



N X 
S,i) 



(4.24) 
(4.25) 

(4.26) 

(4.27) 



Eq. (4.27) implies that by taking the spin-averaged ensemble, an expression for the 2DM is 



obtained which is diagonal in the two-particle spin S, and independent of the third component 
M. It follows that the 2DM has the same block reduction for higher-spin states as for the 
singlet, illustrated in Figure [4Tj 

To summarize, the spin-coupled 2DM, for all spin states S, is related to the uncoupled 
2DM as: 



p5 



ab\cd 



Y.T.(l''-l''b\SM){laclad\SM)T^a.^)^,,Mc..)ida,), (4.28) 



J(ab)(cd) ^ ^ 



with the inverse transformation: 



■^,1 1 



1 1 



(a(Ta)(6o-i,);(c(Tc){Q!o-d) 



^fiMcd)Y,{-a^-a^\SM){-a,-a,\SM)V%^ . (4.29) 



SM 



2 2 



Note that the symmetry under the exchange of single-particle indices in the 2DM is now: 

^ab\cd = (~1) ^ba\cd = (~1) ^ab\dc — ^baidc ) (4.30) 
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No symmetry 



Spin symmetry 




s = 1 



Figure 4.1: Illustration of how spin symmetry of the Hamiltonian induces a block decomposition of 
the 2DM. 



68 



CHAPTER 4. SYMMETRY ADAPTATION OF THE 2DM 



because of the extra phase (—1)^"'^ that arises when the spins are exchanged in a Clebsch- 
Gordan coefficient. The S = block has dimension ^(^ + 1) as it is symmetrical in the 
spatial single-particle indices. The three identical 5 = 1 blocks are antisymmetrical in spatial 
single-particle indices, and therefore have dimension -^{^ — !)• This sums up nicely to the 
dimension of the full 2DM: 



M(M-l) M (M 



+ 1 +3 



M fM 

T T-' 



4.1.3 Decomposition of the two-index constraints 



(4.31) 



All linear matrix maps defined in Section 2.2 have a similar decomposition in block-diagonal 
form. In this Section the spin-coupled form of the matrix maps is introduced. The IDM for 
spin-symmetrical systems is defined as: 



P{aaa){bai,) 



^oa^yal = ^'yaOi^Wi^^^^^M^^ai^^a^^^Y^^M^ 



(4.32) 



M 



from which it is clear that the IDM is reduced to two blocks pj^^: one for spin-up and one 
for spin-down particles. In the same way as for the 2DM it can be shown that by taking the 
spin-averaged ensemble the depence on a vanishes, which means the up and down spin blocks 
are identical. The IDM is derived from the 2DM as: 



N 



bEr 



(a<7)/3;(ca)/3 • 



(4.33) 



By substituting the inverse transformation (4.29) and performing elementary angular mo- 

(4.34) 



mentum recoupling the IDM can be derived from the spin-coupled 2DM as: 

Pac 

i\ — \ ■ _ 

S b 



^E^E^/(^^^^M^f,,, 



The Q condition: the spin-coupled Q condition is defined as: 



[S]' 



N y 
SM,il 1 



(4.35) 



M 



in which the tensor operator B'^ is again given by Eq. (4.15). In exactly the same way as 



for the 2DM the Q map decomposes into a singlet block and three identical triplet blocks. 
To derive the spin-coupled form of the Q map we have to substitute the uncoupled form 



Eq. (2.33) into: 



E ^'^n'^'^^'^b\SM){-(Tc-(Td\SM)Q{T)^^^„^)Q,„^).^^^^),^dai) , (4.36) 



\/P)M)" 



2(r)a6;cd 

and carry out straightforward angular momentum recoupling. Following this procedure leads 
us to the spin-coupled expression of the Q map: 

1 ,. . , . c. . . X 2Tr r 



Q(r)ffe;ed 



1 



{Pac^hd + (-1) ^ad^h^ 



N{N -\) 



+ ^ah;cd 



Viab)icd) 



{^acPbd + {-l)^SadPbc + i-l)^hcPad + SbdPac) ■ (4.37) 
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The condition: the spin recoupling of the Q condition is a bit more involved. As ex- 
plained in Appendix iBl the correct spherical tensor form of a particle- hole operator is: 



^t 



ab 



a^fS" (ib 



with aba, = i-l)^+'''ab-a, ■ (4.38) 

With this operator, the spin-coupled Q map can be defined through a spin-averaged ensemble: 

G{n!b;cd = E^*RJ2 Y^^^SMM^lb a!Mm,) , (4.39) 



[Sf 



M 



which again decomposes into four blocks, one singlet and three identical triplet blocks. The 



expression of Eq. (4.39) in terms of the spin-coupled 2DM is found by substituting Eq. (2.38) 
into: 



0{T)ab:cd 



EE(-i)^-^"-^^4-4 



(Tb\SM) 



0"aO"i, CTcCTtj 



,1 1 



(t^O-c- -ad\SM) G{T)^aaa)(baMca,)(dcTa) ■ 



'2 2 



(4.40) 



The first term of Eq. (2.38), containing the IDM, is easy to recouple. For the second term, 



however, the recoupling formula Eq. (B.14) is needed, which introduces a Wigner-6j symbol 
into the coupled form of G map: 



G{rfab;cd = hdPac " THMEE^'I 



2 J 2 2 ^ Ky^' 



1 1 qi ( ad\cb 

2 2*-^ 



(4.41) 



As there is no symmetry related to the spatial single-particle indices of the Q matrix the four 

3 e 
matrix, i.e. M^. 



blocks all have equal dimension —^ , summing up to the total dimension of the uncoupled Q 



4.1.4 Decomposition of the three-index constraints 

The spin coupling of the three-index conditions is much more complicated than for the two- 
index conditions. There are several ways to couple three spin-g particles to a good total 
spin, as explained in Appendix |B| depending on which two particles are coupled to good 
intermediate spin. We have chosen to first couple aaa and bab to intermediate spin Sab = 0/1, 
and then couple cac with Sab to total spin S = \l\'- 

\ahc;{Sab)SM) = Y. Y. {i2^a-(Tb\SabMab){SabMab-(7c\SM)\aaababcac) . (4.42) 

The choice for a complete orthogonal basisset of three-particle space is more subtle than 



before, and shown in Table 4.1 That this is a correct choice is seen from the fact that 



the sum of the dimensions of the blocks, multiplied by their degeneracy, equals the total 
dimension of uncoupled three-particle space — - — ^-^ — ^^. Once again a norm yj{ab) has 
to be added because of the symmetry of the Clebsch-Gordan coefficients, which means a 
normalized three-particle operator is defined as: 



i?t 



s 

ab{Sab)c 



PtyE E {l^alab\SabMab){SabMabl(Tc\SM)ai^^ 



(«fe) " 



"L,«L 



(4.43) 



aCTb MabCTc 



70 



CHAPTER 4. SYMMETRY ADAPTATION OF THE 2DM 



Table 4.1: A complete spin-coupled basis of three-particle space. 



s 


Sab 


orbit als 


dimension 


degeneracy 


1 

2 
1 
2 
1 
2 
3 
2 





1 
1 


a = b y^ c 
a <b < c 
a <b < c 
a <b < c 


f(f-l) 

M(M_l)(M_2) 

^(f-l)(f-2) 
^(f-l)(f-2) 


2 
2 
2 

4 



There is another (technical) complication that arises from the specific choice of the basis as 



in Table 4.1 For a 2DM the elements are stored in an ordered basis a < b. If we want to 



access a matrix element where the order is reversed, we can use the simple relation: 



T.5 

ab:cd 



-^f^Lcd ■ (4.44) 



For a three-particle basis, however, this is much more complicated. An exchange of the first 
two indices is still simple: 

\abc; {Sab)SM) = {-lf^^\bac; {Sab)SM) , (4.45) 

but the symmetry between the first two and the third index is more involved. Suppose access 



is required to an element abc with c < b < a, then the recoupling formula Eq. (B.ll) has to 
be used: 



\abc; {Sab)SM) 



'{cbl 
(ab) 



[Sab] /_^[S, 



Cb\<i 1 1 q"'' } |C^"; {Scb)SM) , 

9 iJab , 



(4.46) 



to express it as a function of basis states that are stored. 



The 7i condition: the spin-coupled 7i condition is defined as: 



'IV-L )abc-Aez 



2Z ^*T512 5Z \^'^SMAB\h{Sab)c ^i, 



de{Si,)z\'^SM,i 



^ 



N 



+{^';i ,, ^B±,. ^ ,„ bKu^ .^^■^ 



1^ I R-S 

'SM,i\^de{Sd^)2 



N 



ab{Sab)c\^SM,i/ ' 



(4.47) 



with B' given by Eq. (4.43). In an analogous way as for the two-index constraints, it can be 
shown that this expression is diagonal in SM and independent of M. It follows that the 71 
matrix is reduced to two identical S = ^ blocks and four identical S = ^ blocks. One can 



express Eq. (4.47) as a function of the spin-coupled 2DM by substituting (2.46) in: 

1 



S{Sab',Sde) 



'n'- labcidez 



, , ^.,, . Y. Y. ^l,^a\<yb\SabMab){SabMab\ac\SM) 

Y. Y {:^^d^^e\SdeMde){SdeMde-a,\SM) 

O-dCTe MdeO-z 

'Iv-'- ){acra)ibab){ccrc);{da-d){ecre){zcr2) ■ 



(4.48) 
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This leads to some tedious but straightforward angular momentum recoupling which results 
in the final spin-coupled expression: 



TiiT) 



abc\dez 



2Tr r 

N{N - 1) 

Viab){de) 

[Sab] [Sde] 



S„b 
de 



\'{a.o){de) 

[5cz {SbePad + {-'i-f^'Saepbd + {-if'^'^dPae + ^adPbe) 



+ {Sadhe + (-l)^'"''^M<Jae) pcz] 
I , ,,,, , ,1 ! n^\ {^az^cdPbe + ^-^f "^ ^ az^ ecPbd + {-^f^'hz^cdpae 

■^{-^f^'^^^^^bz^ecPad + (<56e'^cd + {-if '^ ^bd^ ce) Paz 
+ (-1)^"'' {^ae^cd + {.-if^^^ce^ad) Pbz + {^az^be + {-if^'^z^ae) Pad 

+ {-'lf''{^az6bd + {-lf-''5bz5ad)Pce] 



+ [Sab][Sde] 



S 2 Sab 
lie 
2 2 "^d-^ 



\bz) 



[Sab] [Sde] V^ r o/n 



Q/ ^ f Q 1 C«/ 

Olio c) o 






'(ac). 
(a6)- 



ac;de 






e^p5„, 



a6;2e 



Q 1 

•^ 2 

1 1 

2 2 



•Jab 



lie 
2 2 '-'dii 



SadVibc){ez)Tte 



+6Mi-lf'^'Vi^cJW)rf,.ez + 6aei-lf'^V{bcJW)^!c, 
+SU-lf'''^''^VM{dz)ra'e;dz 



dz 



(4.49) 



The 72 condition: for the spin-coupled 72 condition a good spherical tensor operator is 
needed which creates two particles and one hole, 



fit 



5 
ab{Sab)c 



7^ X] Yl i2^CL^'^b[SabMab){SabMab-Crc[SM)al^^al^^dca, 



^(«ft) .1 



O-j, MabO-c 



(4.50) 



Note the presence of the a operator, similar to Eq. (4.38). A choice for the orbital ordering 



which generates a complete basisset for spin-coupled two-particle-one-hole space is shown in 



Table |4.2[ In contrast to the three-particle case, there is no recoupling needed to access 
elements of the form c < b < a since there is no symmetry involving the third index. A 
simple addition of the dimensions multiplied with their respective degeneracies shows that 



this basis spans the same dimension as the uncoupled particle- hole basis, i.e. 



M^M-l) 



. Using 



the operator defined in Eq. (4.50) one can proceed to define the spin-coupled 72 condition as: 



'r2(r) 



S{Sab\Sde) 

abc\dez 



5Z ^*Tc]2 5Z \^'^SMAB\h{Sab)c ^ii 
i ^ ^ M 



de(Sd^)z\^SM,i 



^ 



N 



+{^'^ ,, ^B±r. ^ .. bKu^ .,.\^ 



1^ I rS 

'SM,i\^de{Sd^)z 



ab{Sab)c\^ SM,ii 



(4.51) 



72 



CHAPTER 4. SYMMETRY ADAPTATION OF THE 2DM 



Table 4.2: A complete spin-coupled basis of two-particle-one hole space. 



S Sab sp-coords dimension degeneracy 



1 

2 

1 

2 
3 
2 





1 
1 



a<b,c ^(f + 1) 
a<b,c ^(f-1) 



a 



2 

4 



One obtains the spin-coupled form of the Ti map by substituting Eq. (|2.49|) into: 

1 



'21-L )abc:dez 



yj{ah){de 



■EE 

CaCFh MabO-c 

EE 

CTdCTe MaeO-z 



-lf^~'''{l<^a\(7b\SabMab){SabMabl - CJe|5M) 

-l)'^-''^{lad^ae\SdeMde){SdeMde^ - a,\SM) 



'2\l- ){acra){bcri,){cac);{d(7d)iecre)(zcrz) • 



(4.52) 



Notice the extra phases and minus signs arising from the use of a. In the same way as for 
the 7i matrix, the Ti matrix falls apart in six diagonal blocks, two identical spin- 2 and four 



identical spin-| blocks. The first two terms in Eq. (2.49) are of a type that we encountered 



in previous calculations. The last four terms (containing a 2DM) are a bit more involved. 



First we have to use Eq. (B.14) three times, generating three Wigner-6j symbols which can 



be reduced to one Wigner-9j symbol using Eq. (B.16). This leads to the following final 
spin-coupled expression for the Ti map: 



[Sab] [Sde 




/rTTTTTT^.L- J ■» 2 - 2 ( \"""\/(ce)(Mrfe'; 
^y{ab){de) ^ 

+{-^f^'SbdV{ce){za)Tt,.a + {-lf''^daeVicd){zb)Ti;,.,, 



zb 



(4.53) 



The T2 condition: the spin-coupled T2 condition is different from the other constraints, in 
that the sum of two spins appears in the operator that generates the constraint: 



-" ab{Sab)c,m 




+ al 



(4.54) 



M 



The extra one-particle term only contributes to the result when the two-particle-one-hole 
operator couples to 5 = ^; otherwise the spin-averaged ensemble is zero because of spin 
conservation. This means that only the S = 2 part of the 72 matrix is different from the 
regular Ti . Performing the angular momentum recoupling yields the spin-coupled form of the 
T2 condition: 

I '^{^)abc;dez ^- 

i{Sde) 

m:dez 



-T-//-p\ 2 i^ablSde) 

'2\ J abc,m;dez ,n 



\ 




(4.55) 
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with io defined as: 



/^""^ =^L= Y, Yl i-^)''~'''{l^al(Tb\SabMab){SabMabl _, ^ 






=^(-i)'+^"^A/Mrf-.- (4.56) 

The 5" = I block of To is identical to the that in Ti: 



2 

^^(nlSiL = T^inl^li!. • (4.57) 

4.2 Symmetry in atomic systems 

The atomic Hamiltonian in the Born-Oppenheimer approximation consists of a single-particle 
term, including the kinetic energy of the electrons and their attractive interaction with the 
positive central charge Z, and a two-particle term describing the Coulombic repulsion between 
the electrons: 

in which atomic units have been used. Atomic systems are highly symmetric: the spin sym- 
metry explained in the previous Section is present, but also symmetry under spatial rotations 
and under a reflection through the origin. The combined symmetries can be exploited to 
obtain a very large reduction of the dimensions of the blocks in all the matrices involved in 
the SDP, which leads to a huge speedup of the program. 

4.2.1 Symmetry under spatial rotations 

Symmetry under spatial rotations is of the same type as symmetry under spin rotations, 
and leads to the introduction of (orbital) angular momentum. The single-particle basis is 
transformed to explicity introduce the extra angular momentum quantum number: 

\a) -^ \amaaa) -^ UnaDrriaCra) . (4.59) 

The spatial orbitals a are labeled by a main quantum number n, and an angular momentum 
/. In the discussion of spin symmetry for spin- 2 electrons, the total two-particle spin was 
limited to 5 = and 5 = 1. For angular momentum, depending on the basis used, the orbital 
angular momentum, /, takes on different integer values, corresponding to the s,p,d, f,g, . . . 
orbitals. This implies that the two-particle angular momentum L can have many different 
values. A two-particle state in spin and angular momentum coupled form is defined through 
the relation: 

\ab-LMLSMs) = Y Yl {^<^a^'^b\SMs){lamJbmb\LML)\ama(Jabmbab) . (4.60) 

The norm of such a state is not unity: 

{ab; LMLSMS\cd; L' M'l^S' M's) = Sss'^lu^MlM'^^MsM'^ 



(<5aAd + (-l)^+^+'"+'''5aAc) , (4.61) 
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implying that the normaUzed two-particle creation operators, which form the basis for ex- 
pressing the 2DM, are defined as: 

B^tb = -^ yZ yZ &al<r,\SMs){lamJ,m,\LML)ai^^^^al^^^^ . (4.62) 






Note that the symmetry of a two-particle state under the exchange of the two spatial indices a 
and 6, depends on L, S and the single-particle angular momenta la and lb- This is a problem 
because, in a block with two-particle quantum numbers LS, the single-particle indices can 
have a different permutation symmetry. This is solved by the introduction of the parity 
quantum number in the next Section. 



4.2.2 Reflection symmetry and parity 



H,P 



where P 



The eigenstates of a Hamiltonian symmetric under spatial inversion 

changes r in — r) can be chosen as simultaneous eigenstates of the inversion operator. Since 
P'^ = 1, P can have only two distinct eigenvalues: 

Pl^'^) = 7r|^'') , (4.63) 

where vr = ±1 is called the parity of the state \^'^). 

The single-particle states in a spherical basis can be decomposed in a radial part fni{f)-, 
and an angular dependend part, described by a spherical harmonic Yim{0, (j)). Using the parity 
property of spherical harmonics, we have that: 

PYUe,(l)) = {-iyYUe,ct)) . (4.64) 

It follows that the spin and angular momentum coupled two-particle states introduced in 



Eq. (4.61) have good parity: 

P\ab;LMLSMs) = (-l)'"+'^|a6; LMl^Ms) . (4.65) 

This solves the problem with permutation symmetry mentioned in the previous Section, since 
the exchange of the spatial orbitals can now be written in terms of the two-particle block 
coordinates L, S and vr: 

\ab; LMlSMstt) = 7r(-l)^+^|6a; LMlSMstt) . (4.66) 

4.2.3 The 2DM for atomic systems 

All symmetries present in an atomic system can be exploited by considering a spin and angular 
momentum averaged ensemble: 

i ^'^J ^^J MsMc 



in which the two-particle creation operator B^ is given by Eq. (4.62). To a block L'^S, only 



those pairs ab contribute for which the angular momenta la and Ih satisfy the triangle relation: 

\la-lb\<L<la + lb, (4.68) 
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and have the correct parity, i.e. 

(-1)'"+''' =7r . (4.69) 

The spin and angular momentum coupled 2DM is related to the uncoupled 2DM through a 
unitary transformation involving four Clebsch-Gordan coefficients: 

^ab;cd = ,, \, ,, Yl E {l^a\<^b\SMs){lamahmh\LML) 



Yl Yl {i^'yci^'yd\SMs){lcmcldmd\LML) 



^2 2 



>- {amaCra){bmi,<Ji,);(cmcCrc){dmacra) • V^- ' Uj 

To appreciate what an enormous reduction is obtained in the dimensions of the 2DM blocks, 



the dimensions and degeneracies of the different blocks are shown in Table 4.3 for the cc- 
pVDZ, cc-pVTZ and cc-pVQZ atomic basis sets [6l] for first row atoms (from Li to Ne). A 
cc-pVDZ basis consists of 28 spinorbitals, 

implying that the dimension of the two-particle space is 378. In the spin and angular mo- 
mentum coupled version of the program we only have to store 14 blocks, the largest of which 
has a dimension of 10. The scaling of the heaviest matrix computations in the program goes 
as 0{n^). If we compare the computational cost a such a computation on an uncoupled 2DM 
with the cost on 2DM in the fully coupled program, we see a increase in efficiency with a 
factor of 20821. The speedup becomes even more spectacular for the larger cc-pVTZ basis, 
which has 60 spinorbitals, 

{lsf{2sf{2pf{3sf{3pf{3d)'^{4sf{Apf{M)'^{4f)'\ 

and for which the dimension of two-particle space is 1770. This reduces to storing 22 blocks, 
the largest of which has a dimension of 21. The reduction factor in the computational cost of 
an elementary matrix computation on a 2DM is 125948. Finally, for the cc-pVQZ basis there 
are 110 spinorbitals 

{ls)\2sf{2pf{3sf{3pf{3d)''{Asf{^pfm'\Af)'\5s)\5pf{5d)'^{5f)'''{5gy\ 

with a two-particle space dimension of 5995, which reduces to 30 blocks with a maximal 
dimension of 46. The reduction factor in the computational cost of an elementary matrix 
computation on a 2DM is 456052. This example shows that symmetry can reduce the com- 
plexity of the problem enormously and should be exploited to the full. 

4.2.4 Two-index constraints for atomic systems 

The IDM for atomic systems using the spin and angular momentum averaged ensemble reads: 

P{ama(Ta);{cmc(7c) =/^'"^i \C]2\r]2 Z^ \^SMsCMc,i\"'ama(Ta^cmcac\'^SMsCMc,i' 
i ^ J ^ J MsMc 
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Table 4.3: Dimensions and degeneracies of the 2DM blocks for first-row atoms (from Li to Ne) in a 
cc-pVDZ, cc-pVTZ and cc-pVQZ basis set. 





dimension 


deg 




dimension 


deg 


(L-5) 


DZ 


TZ 


QZ 




{L-S) 


DZ 


TZ 


QZ 




(0+0) 


10 


20 


35 


1 


(0+1) 


4 


10 


20 


3 


(1+0) 


1 


4 


10 


3 


(1+1) 


4 


10 


20 


9 


(1-0) 


10 


20 


40 


3 


(1-1) 


8 


20 


40 


9 


(2+0) 


7 


21 


46 


5 


(2+1) 


4 


15 


36 


15 


(2-0) 


2 


8 


20 


5 


(2-1) 


2 


8 


20 


15 


(3+0) 





4 


15 


7 


(3+1) 


1 


7 


21 


21 


(3-0) 


2 


12 


34 


7 


(3-1) 


2 


12 


34 


21 


(4+0) 


1 


7 


26 


9 


(4+1) 





4 


20 


27 


(4-0) 





2 


12 


9 


(4-1) 





2 


12 


27 


(5+0) 








4 


11 


(5+1) 





1 


7 


33 


(5-0) 





2 


12 


11 


(5-1) 





2 


12 


33 


(6+0) 





1 


7 


13 


(6+1) 








4 


39 


(6-0) 








2 


13 


(6-1) 








2 


39 


(7+0) 











15 


(7+1) 








1 


45 


(7-0) 








2 


15 


(7-1) 








2 


45 


(8-0) 








1 


17 


(8-1) 
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and is diagonal in a, m and I. A block labeled by single-particle angular momentum I has a 
degeneracy of 2(2/ + 1) and a dimension equal to the number of spatial orbitals (n/) that have 
angular momentum /, e.g. for the cc-pVDZ basis the dimension of the s block equals three, 
since the basis has three types of s orbitals, Is, 2s and 3s. The IDM can be derived from the 



2DM, in a similar fashion as in Eq. (4.34): 



ruaric 



2[lfN 



■K LS b 



S 
d)b;{ncl)b 



(4.72) 



The matrix constraints in spin and angular momentum coupled form are derived through 
procedures that are completely analogous as those discussed for spin symmetry in the pre- 
vious Section. Since two types of recoupling have to be carried through (spin and angular 
momentum), all expressions become a bit more elaborate. For the sake of completeness we 
nevertheless provide them in the remainder of this Section. 



The Q condition: the spin and angular momentum coupled Q condition has the same 



block reduction as the 2DM (as shown in Table 4.3). It is defined through a spin and angular 
momentum averaged ensemble: 



Q(r) 



ab:cd 



E 



1 



Wi 



[sncy- 



E 

MsMc 



,Nn 



{^SMsCMc,i\^ab 



L^'S, 



BK'B^t.\^ 



cd 



,Nn ^ 



(4.73) 
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with B"^ given by (4.62). Expressed as a function of the 2DM this becomes: 

1 



^^^^'^'^^^ 7P)M) 



{Sac6M + 7r{-l)^+^6ad6tc) J?^ + rf^ 



1 



Viab){cd) 



N{N -1) 



ab\cd 



nyrid 



+vr(-l 



■sL+S 



SadSl.lAtL + ^bcKlAaU, 



(4.74) 



The Q condition: for the Q condition it is necessary to define a particle-hole operator using 
a slightly modified annihilation operator to construct a good spherical tensor operator: 



At 



ah 



a\ ® «fe 



LS 



with a defined as: 



With this operator the spin and angular momentum coupled Q condition becomes: 



Q^)ab;cd — /_^ 



1 



Wi 



[snc\^ 



/ , y^SMsCMcA-'^ oh ^cd \^SMsCM 



(4.75) 
(4.76) 

(4.77) 



MsMc 



After some angular momentum recoupling the Q condition can be expressed as a function of 
the 2DM as: 



a(r).^.;i. = M./.piit-x/MMEE[^1'[^lM! I %\\'^ - '^r<\^TdJ' 



tt' L'S' 



2 2 ^' ) Vc Id L' 



(4.78) 

In Table 4.4 the dimensions and degeneracies of the ^-matrix blocks for first-row atoms in a cc- 
pVDZ, cc-pVTZ and cc-pVQZ basis set are shown. The reduction factor in the computational 
cost is even more spectacular for the Q matrix, i.e. respectively 28595, 144794, and 489488. 



4.2.5 The three-index constraints for atomic systems 

We choose to couple the first two particles to good intermediate angular momentum Lab and 
intermediate spin Sab: with intermediate parity -Kab = (— 1) "^ ''. The operator creating three 
particles coupled to good total spin and angular momentum in this way is: 



5^5^:^ 5..)c =^= E E {\'ya\cjb\SabM!,){SabM!,\a,\SMs) 



(ab) ^^^-^'2 2 

ab •- 



(4.79) 



Y^ Y^ {lamalbmb\LabM^f,){LabM^i,lcmc\LML)al^^^^al^^^^al^^^^ . 



mamt M\mc 

ab ^ 



Only those triplets abc that can couple to total angular momentum L and that have the right 
parity vr are allowed in a block (L'^S), i.e. the triplet lahlc has to satisfy the relations: 



\Lab — lc\ ^ L < Lab + ^c , l^a — ^fcl < Lab < la + h j 



and 



^_iya+ib+ic ^ ^ _ (-4_8o) 
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Table 4.4: Dimensions and degeneracies of the t/-matrix blocks for first-row atoms (from Li to Ne) 
in a cc-pVDZ, cc-pVTZ and cc-pVQZ basis set. 





dimension 


deg 




dimension 


deg 


{L-S) 


DZ 


TZ 


QZ 




{L-S) 


DZ 


TZ 


QZ 




(0+0) 


14 


30 


55 


1 


(0+1) 


14 


30 


55 


3 


(1+0) 


5 


14 


30 


3 


(1+1) 


5 


14 


30 


9 


(1-0) 


16 


40 


80 


3 


(1-1) 


16 


40 


80 


9 


(2+0) 


11 


36 


82 


5 


(2+1) 


11 


36 


82 


15 


(2-0) 


4 


16 


40 


5 


(2-1) 


4 


16 


40 


15 


(3+0) 


1 


11 


36 


7 


(3+1) 


1 


11 


36 


21 


(3-0) 


4 


24 


68 


7 


(3-1) 


4 


24 


68 


21 


(4+0) 


1 


11 


46 


9 


(4+1) 


1 


11 


46 


27 


(4-0) 





4 


24 


9 


(4-1) 





4 


24 


27 


(5+0) 





1 


11 


11 


(5+1) 





1 


11 


33 


(5-0) 





4 


24 


11 


(5-1) 





4 


24 


33 


(6+0) 





1 


11 


13 


(6+1) 





1 


11 


39 


(6-0) 








4 


13 


(6-1) 








4 


39 


(7+0) 








1 


15 


(7+1) 








1 


45 


(7-0) 








4 


15 


(7-1) 








4 


45 


(8-0) 








1 


17 


(8-1) 








1 
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In each L'^S block of three-particle space there are many ways to form a complete basis set. 
The choice of a complete basis is more difficult than for three-particle spin coupling because 
when / > 1 it is possible to put more than three particles in the same shell nl. Not all 
intermediate angular momenta Lab can be reached however, because of the antisymmetry of 
the total three-particle state. The symmetry of the state under the exchange of the first two 
spatial indices is determined by the intermediate angular momentum, spin and parity: 



\abc; {LlfSab)LMLSMs) 



Vra6(-1) 



^ab~^^ab 



\bac; {LlfSab)LMLSMs) 



(4.81) 



The part of three-particle space which is antisymmetrical under exchange of the first two 
indices (i.e. TTab{—'^)^'^''~^^'^'' = —1) is completely spanned by the the states a < b < c. If we 
want to extract information about states with a different ordering, e.g. c < b < a, we have 



to recouple, using two times formula (B.ll): 



abc;{L2''Sab)LMLSMs) 



(ab) 



[Sab][L 



ab 



I,- 



E 

Lcb 

Lab 



[Scb][L 



cb\ 



Scb 
^ab 



cba-{L2'Scb)LMLSMs) 



(4.82) 



The part of three-particle space which is symmetrical under the exchange of the first two 
indices is spanned in part by states of the type a < b < c, but other types of states can occur. 
A second type is where the first two spatial indices are equal, a = b ^ c. The two equal 
indices can couple to all intermediate angular momenta allowed by the triangle rule, provided 



(-ly 



-•Sab 



1 . 



(4.83) 



4.2. SYMMETRY IN ATOMIC SYSTEMS 



79 



A new type of state, which didn't occur when only spin was taken into account, consists of 
the three particles in the same shell, a = b = c. There are many states of the form: 



\aaa; {L+^Saa)LMLSMs) , 



(4.84) 



which have finite norm but are not linearly independent. A suitable set has to be chosen 
which spans the full space. This can be done by constructing the overlap matrix: 



SL'^,S',^;LaaSaa ={aaa; {L'^^S'aa)LMLSMs\aaa; {L'^^Saa)LMLSMs) 

=^S'aaSaa^L'aaLaa+'^[Laa][L'aa\[Saa][S'aa]\'i q qT 

L 2 aa 



L L„, 



, (4.85) 



and taking the eigenvectors with nonzero eigenvalues as basis states. 

To simplify the equations, we introduce the notation Xab for the collection of intermediate 
quantum numbers Li^f' Sab- The phase under the exchange of the first two indices is written 



as: 



and by [Xab] the product [Sab] [Lab] is implied. 



(4.86) 



The 7i condition: the spin and angular momentum coupled 7i condition is defined through 
a spin and angular momentum averaged ensemble: 



'UJ^ )abc-4ez 



with 



i ^^J ^'^J MsMc 






+ (^ 



nh 



\B 






CMcSMs,i\^de{Xae)z " ab{Xai)c\^ CMcSMs 



,)) , (4.87) 



B 



ab{Xab)c 



VW) 



ai ® aX 



J-'a.h^a. 



LS 



(4.88) 



This can be expressed as a function of the 2DM using anticommutation relations and angular 
momentum recoupling in much the same way as the spin-coupled 7i condition, leading to the 
expression: 



q- (■p\L'" S{Xab\Xj_^) 

'^\^ labc-Aez 



2Tr r 

N{N -I) 



SczSx^bX,. (5ad5,e + (-l)^-<5ae4.) + S.Jx^.X.T^^, 



y/{ah){dt 



de 



Sx„ 



,Xj 



^{ab){de) 



+5adKlAbn}j + i^adSbe + (-if^'Sbd^ae) KlJtL 



[^a\}\[^de\ \S ^ Sab 



^JJaE)(de) i2 2 ^'^<^ 



L Ic J^ab I 
lb la Lde J 



5az5cdSl,lAi>n,^ + [-l)^''^5azSecKlA''^ 



i-bi-drnbn^ 



+ (-1)^- {/^ t III} {^'^^^'^''^^^'^P'-^-^ + i-^)''''^bzSec6uA:L + hzSaeSuA: 



) 
■n-d 
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+ {-l)^^^^Sb^5adSljJ::l + {SaeScd + {-ir'^S^Jad) Sl^l^pl^ 



ik) 



+ [Xab][Xde] 



b 2 ^ab 
lie- 
2 2 '-'d.e 



+Sa 



hz 



\^ab + ^d 



'{ac) \L Ic Lab I r^Sa, 



(ab) \la h Lde 



ac:de 



(bz) /L Ic Lab\ j.Xa, 



(ez) \L l^ Lde I T^Xab 



ab)\lb la LdeS^^''^^^^^"'^ {de)\le h Lab 



^'"'^ ^ yide)\ld le Labj''^'^'^^ 



+ 



[Xab][X, 



del 



y/{ab){de) j^' W \ Sab 



Q 1 nf 

2 2 '-'de 



SadV{bc){ez) 



+6bd{-lf''Wiac)iez) 



L la L')(L Id L' 

lb 'c Lab) ['e Iz Lde 

L lb L' 

'a 'c ^ab 



+6ae{-l)''^^^/{bc)W)\f ]" f 

l-b '■c ^ab 



bc;ez 

L Id L' 

^e l-z ^de 

L le L' 

Id Iz Lde 



pX' 

^ ac:ez 



yX' 

bc\dz 



+<5be(-l)^"''+^'^V(ac)(d2) 



L lb L' 

'a 'c ^ab 



L le L' \ X' 
1 1 T ( ac\dz 
I'd I'Z J^de 



ab:ze 



(4.89) 



In Table 4.5 the dimensions and degeneracies of the 7i matrix blocks are displayed. Without 
including symmetries the use of three-index conditions would be computationally infeasible. 
For the small cc-pVDZ basis the size of the uncoupled matrix is 3276, for the TZ already 34220 
and for the QZ basis the dimension becomes 215820. The sizes of the blocks in the matrix 
with all symmetries exploited, however, remain manageable even for the largest basisset. The 
reduction factor in the computational cost of an elementary 7i matrix operation is 166466 
for the DZ, 933490 for the TZ, and 3092934 for the QZ basis. 



The 72 condition: the spin and angular momentum coupled form for the 72 map is defined 
as: 



r2(r) 



L^S{Xab;Xa,) 

abc\dez 



E 



1 



w, 



MsMc 



L^S 



-,N 



r£12r512 Z^ y-'SMsCMci^^ ab{Xab)c ^deiXde)z\^SMsCMc,i/ 



iL'^S 



+ \^ CMcSMs,i\^de(Xde)z ^ ab{Xab)c\^ CMcSMs/i-l 



(4.90) 



The coupled two-particle-one-hole creation operator is given by: 



At 



ab{Xab)c 



VW) 



4^4 



LabSab 



LS 



(4.91) 



where the spherical tensor operator cic (see Eq. (4.76|)) appears. As there is no symmetry 



involving the third index, the ordering of indices is easier than for 7i, i.e. the ordering is 
a <b,c when (—I)"''-'''' = 1 and a <b,c when (— l)'''""' = —1. Through anticommutation and 
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Table 4.5: Dimensions and degeneracies of the Ti-matrix blocks for first-row atoms (from Li to Nc) 
in a cc-pVDZ, cc-pVTZ and cc-pVQZ basis set. 





dimension 


deg 




dimension 


deg 


(L-S) 


DZ 


TZ 


QZ 




(L-5) 


DZ 


TZ 


QZ 




(o+|) 

(oi) 


27 


110 


338 


2 


(0+|) 


5 


32 


118 


4 


4 


27 


110 


2 


(o-f) 


6 


26 


84 


4 


(1^) 


24 


130 


477 


6 


(1+i) 


17 


80 


274 


12 


(1-^) 


44 


206 


692 


6 


(i-f) 


16 


85 


306 


12 


(2+^) 


38 


226 


857 


10 


(2+i) 


11 


90 


374 


20 


(2-^) 


24 


164 


668 


10 


(2-i) 


12 


85 


346 


20 


(3+^) 


12 


136 


672 


14 


(3+i) 


8 


76 


358 


28 


(3-^) 


20 


182 


828 


14 


(3-i) 


8 


79 


382 


28 


(4^) 


8 


116 


671 


18 


(4+i) 


1 


46 


298 


36 


(4-^) 


4 


85 


550 


18 


(4-i) 


2 


44 


282 


36 


(5+^) 


1 


42 


376 


22 


(5+i) 





22 


195 


44 


(6+1) 


2 


60 


462 


22 


(5j) 





22 


208 


44 





24 


275 


26 


(6i) 





8 


118 


52 


(6-^) 





15 


218 


26 


(6-i) 





8 


112 


52 


(7+^) 





4 


109 


30 


(7+i) 





2 


57 


60 


(7-^) 





8 


144 


30 


(7-i) 





1 


60 


60 


(8+^) 





2 


65 


34 


(8+i) 








22 


68 


(8-^) 





1 


46 


34 


(8-i) 








22 


68 


(9+^) 








15 


38 


(9+i) 








8 


76 


(9-^) 








24 


38 


(9-i) 








8 


76 


(10+i) 








8 


42 


(10+i) 








1 


84 


(10-i) 








4 


42 


(lo-f) 








2 


84 


(iii) 








1 


46 


(11+|) 











92 








2 


46 


(Hi) 











92 
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angular momentum recoupling one obtains an expression of Ti as a function of the 2DM: 



'2V-L )abc;dez 



^X^,X^ (5,,<5,e + (-l)^-5„e56d) SijJ^i^ + 5,,6x^,x,r^C 



V{ab){de) 



^J{ah){de 
S 



Et^'] 



X' 




Xab 

de 



Ic L' le 
L Iz Lde 

c - ^. >r 



' ce:zb 



X' 



+(-1)^'"'<5mV^MM < Ic L' le 

yLab la lb ^ 

{J-j iz J-^de 
I, L' I 
J-iab lb la ^ 

{J^ Iz L'de 
Ic L' h 
^ab la lb , 



^ i cd;zb 



' cd:za 



(4.92) 



In Table 4.6 the relevant matrix dimensions for 72 are shown. For comparison, the uncoupled 
72 matrix has a dimension in the three basissets, DZ, TZ and QZ, of respectively 10548, 
106200 and 659450. The reduction factor in the computational cost of an elementary 72 
matrix operation is 201137 for the DZ, 1015334 for the TZ, and 3235624 for the QZ basis. 



The 72 condition The spin and angular momentum coupled 72 map is identical to the 
regular 72 map when the total spin S* is | or the total angular momentum L is larger than the 
maximal single-particle angular momentum in the basis set /max- When S = ^ and L < /max 
the 72 matrix has the form: 



2v ) abc,nm;dez ,nn 



'^\^)abc;dez 

■\{Xde) 



l^ 2^Xab;Xi^) iXah) 



CO 



OJ' 



(nml)\dez 

in which cj can be written as a function of the 2DM: 



abc;{nnl) 
(0 

Primrin 



(4.93) 



if;in=^(-i)^""-yMr-^ 



abc;{nnl) 



ab 

ab;(n„l)c 



(4.94) 



4.3 Symmetry in the one-dimensional Hubbard model 

The one-dimensional Hubbard model [65j is the simplest model possessing non-trivial corre- 
lations present in a solid, described by the Hamiltonian: 



H = -tY^ Ua,+i,„ + al_^_-^.^ai,„j +^Y1 ^It'^^ta^o^ • 



(4.95) 



It pertains to a one- dimensional lattice, with sites labeled hy i = 1, . . . ,L. Periodic boundary 
conditions (PBC) are assumed throughout the thesis. 

The complexity of this seemingly simple schematic Hamiltonian lies in the competition 
between the first term, called the hopping term, which delocalizes the electrons, and the 
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Table 4.6: Dimensions and degeneracies of the 72-niatrix blocks for first-row atoms (from Li to Nc) 
in a cc-pVDZ, cc-pVTZ and cc-pVQZ basis set. 





dimension 


deg 




dimension 


deg 


(L-S) 


DZ 


TZ 


QZ 




(L-S) 


DZ 


TZ 


QZ 




(0+^) 


85 


336 


1023 


2 


(0+i) 


32 


142 


456 


4 


(0-^) 


14 


85 


336 


2 


(o^i) 


10 


53 


194 


4 


(1+^) 


72 


390 


1431 


6 


(1+i) 


41 


210 


751 


12 


(1-^) 


132 


618 


2076 


6 


(i^i) 


60 


291 


998 


12 


(2+^) 


113 


676 


2567 


10 


(2+i) 


49 


316 


1231 


20 


(3+1) 


70 


488 


1998 


10 


(2-f) 


36 


249 


1014 


20 


37 


410 


2020 


14 


(3+1) 


20 


212 


1030 


28 


(3-^) 


62 


550 


2490 


14 


(3-i) 


28 


261 


1210 


28 


(4+i) 


24 


348 


2013 


18 


(4+1) 


9 


162 


969 


36 


(4-^) 


12 


255 


1650 


18 


(4-i) 


6 


129 


832 


36 


(5+^) 


2 


124 


1124 


22 


(5+i) 


1 


64 


571 


44 


(5-^) 


6 


179 


1384 


22 


(5~i) 


2 


82 


670 


44 


(6+^) 


1 


74 


829 


26 


(6+i) 





32 


393 


52 


(6-^) 





46 


656 


26 


(6-i) 





23 


330 


52 


(7+^) 





12 


327 


30 


(7+i) 





6 


166 


60 


(7-^) 





24 


432 


30 


(7^1) 





9 


204 


60 


(8+|) 
(8i) 





6 


194 


34 


(8+|) 





2 


87 


68 





2 


136 


34 


(s-f) 





1 


68 


68 


(9+^) 








46 


38 


(9+i) 








23 


76 


(9-^) 





1 


74 


38 


(9-i) 








32 


76 


(10+^) 








24 


42 


(10+i) 








9 


84 


(10-i) 








12 


42 


(lo-f) 








6 


84 


(11^) 








2 


46 


(11+i) 








1 


92 


(11-^) 








6 


46 


(ii~i) 








2 


92 


(12+^) 








1 


50 


(12+i) 











100 



84 
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Figure 4.2: Illustration of the two terms present in the Hubbard Hamiltonian, electrons can jump 
to nearest-neighbour sites with amplitude tij. When two electrons are on the same site, there is an 
energy penalty of U. 
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E 



k = 7T 



k^ 



3 



k=^ 



27r 
3 



4^fc = f 



4^fc 



2L 
3 



4^/c = o 



2t 

t 


-t 

-2t 



Figure 4.3: A representation of the structure of the ground state when U 
lattice of length L = 6. 



for a half-filled 



second on-site repulsion term which is diagonal in the site basis. In Figure 4.2 a graphic 
representation of the two terms in the Hubbard Hamiltonian is shown. As there is no preferred 



direction in spin space in Eq. (4.95), spin symmetry can be exploited and all the results from 



Section 4.1 are taken over. However, far more symmetries are present in the model, which 
allow for a huge reduction of the matrix dimensions involved. 



4.3.1 Translational invariance 



When periodic boundary conditions are assumed (as in Figure 4.2) the Hamiltonian is in 



variant under translations along the lattice. This is an abelian symmetry for which it is 
easy to block diagonalize the 2DM and its matrix maps, as the correct basis transformation 
in single-particle space automatically block diagonalizes all matrices on higher-order particle 
space. Translational invariance is exploited by Fourier transforming the site basis to quasi- 
momentum space: 



\ka) 



lE 



e^^'lja) 



where L is the lattice size, and k takes on the values -^ for n = 
hopping part of the Hamiltonian becomes diagonal in this basis: 



(4.96) 



,L — 1. The kinetic or 



Hh, 



op 



-2^E 



cos k a\^ak 



(4.97) 



k(j 
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from which it follows that in the non-interacting {U = 0) ground state, the electrons occupy 



the states with momenta lower than the Fermi level, as illustrated in Figure 4.3 for a half-filled 
lattice of length L = 6. 



4.3.1.1 The 2DM for translationally invariant systems 

The eigenstates of the Hubbard Hamiltonian have a good total quasi-momentum /C. The 
2DM for these states, expressed in the quasi-momentum single-particle basis: 






Y.^^^jLY^^'^sk'^B^lk, <fcj^, 



[S] 



NK \ 
SM,il ' 



M 



where 






\/iKh 



ai (g) at 



(4.98) 



(4.99) 



is automatically block diagonal, because the only non-zero matrix elements in Eq. (4.98) are 



those which conserve momentum: (ka + /cf,)%27r = (kc + kii)Vo2T:. This means we have L 
blocks F'^-'^ for every S, with two-particle states that satisfy K = {ka + kh)%2TT, and a block 
dimension that scales linearly with lattice size L. 



4.3.1.2 T^vo-index constraints for translationally invariant systems 

The spin-symmetric matrix constraints simplify considerably by including translational in- 
variance, because the IDM is automatically diagonal in the quasi-momentum basis: 



Pk 



E^^TcJ2E^^5.M,>L«fc-l^ 



[S] 



NK. \ 
SM,i) 



M 



The translationally invariant IDM can be derived from the 2DM as: 

12 



Pk 



N 



3Y 2_^ ~2~ Z^(^^ Wkk';kk' 



(4.100) 



(4.101) 



The Q map: the translationally invariant Q map is defined as: 

1 



Qi^)k5ct;k,ka =^^^r^^i^SM,i\BkX B^ 



NK \T3SK o'tSK .jNK \ 
kckJ^SMA/ ' 



M 



(4.102) 



with B' given by Eq. (4.99). The expression of the Q map as a function of the 2DM becomes 
quite simple: 



^SK 



^ ' kak\,\kckfi kakfj'^kckd 



i^kakjkbkd + {-'^)^^kaka^ki,kc 



2Tr F 

N{N-l) 



Pka - Pkt 



y^ikakb){kckd) 

(4.103) 
The Q map is also block diagonal in K = (ka + kb)%2Tr, and has the same dimensional 
reduction as the 2DM. 
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The Q condition: the Q condition is slightly more complicated, because the correct anni- 
hilation or hole operator is given by: 



flfc^ = (-l)2+'^afc_^ , with k = -k%27r . 

Using this operator the translationally invariant Q map becomes: 



[Sf 



(4.104) 



(4.105) 



M 



where K = (ka + k}j)%2'K = {k^ + kd)%2-K and with the particle-hole operator defined by: 



^t 



SK 

Kar^b 



"L ^ ^fcfc 



SK 



The G map can be expressed as a function of the 2DM: 



^(r)fil,;fc,fcd = 5k^kahak,Pka " \J {kakd){khkc) ^^ 



2 2 ^\ T^'K' 
2 2 '-^ ) 



(4.106) 



(4.107) 



from which one can see that the blocks in the Q matrix with K = {ka+kf,)%2TT = {kc+kd)%2Tr, 
correspond to the blocks with K' = {ka + kd)%2Tr = (kc + ki,)%27r in the 2DM, as can expected 
for a particle-hole transformed quantity. 



4.3.1.3 Three-index constraints for translationally invariant systems 

Three-particle space is built up of the same type of states as in the spin-coupled case (See 
Table E?T|, but now divided into L separate Sections with the same three-particle momentum 
K = {ka + kh + kc)yo27r. From now on Latin indices a are used to represent momenta ka, in 
order to ease notation. 



The 71 map: in translationally invariant form, the single-particle part of the 7i map sim- 
plifies considerably: 



ri(r) 



SK(Sab;Sde) _ ^CZ^SabSi, 



abcidez 



yj{ab){de] 



{SadSbe + i-lf'^'SaeSbd) 



2Tr r 



\N{N - I] 



Pa- Pb- Pc 



[Sab][Sde\ is K Sab 



, , ,,,,, , X ! T\ i^-^^cdSbe + {-ir^^SaJceSbd + (-l)^"''(^fe.^a 

y'[ab)[de) 1 2 2 '-'de) 

+ i-lf''''^'''SbJce5ad) [Pa + Pb + Pc] + SczSs^.S.A^e'" 



S r, Sab 



2 
1 

2 2 



+ [Sab][Sde]ri I r} ^«W|^rf-^ + <^,.(-i)^-+^-J|^rf-^^^^ 






[Sab] [Sde\ V^rc'l 



yj{ab){de 



S' 



S'] (S 



1 1 

2 2 



s. 



ab 



1 1 

2 2 



5' 
Sde 



^S'K,. 



5a.\/(Mrez)r-:- 



+4d(-l)^"''7K)Mrf:^r + Saei-lf''^V(bc}W)rZ: 
+56e(-l)^"''+^^^ V(ac)(dz)rf^^r 



S'Kt, 
dz 



(4.108) 
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In the above equation the intermediate momenta appearing in the 2DM part of the equation 
are defined as: 

Kab = {a + b)%27r . (4.109) 

It can be seen that if three-particle momentum conservation is fulfilled, i.e. (a + 6 + c)%27r = 
{d + e + z)%2tt, then the terms containing the 2DM conserve two-particle momentum. The 
three-particle matrix reduces to a block-diagonal matrix with L quasi-momentum blocks for 
every S, each of which scales quadratically with the lattice size L. 

The Ti map For the Ti map we need to define a translationally invariant two-particle-one- 
hole operator: 



i?t 



SK 

ab{Sab)c 



1 



7P) 



4^4 



SabKah 



SK 



(4.110) 



where a is used, as defined in Eq. (4.104), and K = (a + 6+c)%27r. The definition of 72 is then 



exactly the same as in Eq. (4.51), when performing the usual angular momentum recoupling 

and can be rewritten as a function of the 2DA' 

Scz {6ad5be + {-if^'Saehd) Pc + SczSs^,SaJab-a 



and anticommutation relations, and can be rewritten as a function of the 2DM: 



'21^ )abc:dez 



yj{ah){di 

[Sab] [Sde 




SabKab 

de 



^J{ab){de) 

+(-i)^'"'4dvWWrcS 



vWMrfe; 



S'Kae 

ce:zb 



+ 



-i)^-<5.evMMrf4 



S'K^d 
b 



+{-lf-''+^''^6beV{cd){za)r 



cd:za 



(4.111) 



Again one sees that if the two-particle-one-hole momentum is conserved, i.e. {a + b + c)%2Tr = 
(d -\- e + z)%27r, then the two-particle momenta appearing in the rows and columns of the 
2DM are the same, e.g. if a = d then (c + e)%27r = (z + 6)%27r. The Ti map also becomes 
block diagonal, with each block scaling quadratically with lattice size. 

The 72 map The 7^' condition can be similarly treated. For 5 = | it again reduces to the 
regular 72 map. For S = i^ the dimension of each K block increases by one: 



'-J~' fpX 2 ^(■-'abi'-'de) 

'2 1-'- )abc:dez 



q- f-p\2^\^'^l''^<ie) (Sab) 

' "^y^ ) abc;dez ^abc;k 

iiSde) 

\ ^ k-.dez Pk / 



with 



ix) 



(Sab) 

abc,k 



M(-i)^+^-vWraf"^ 



(4.112) 



(4.113) 



4.3.2 Parity 

The Hubbard model with periodic boundary conditions (PBC) has more symmetries than 



spin and translational invariance, one of them being parity. As we saw in Section 4.2, which 



deals with atomic systems, parity follows from the symmetry under the inversion of space, 
i.e. r — )■ — r. One can readily appreciate from Figure 4.2 that a similar discrete symmetry is 
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present here, i.e. the Hamiltonian is invariant for the inversion of the site index i — )• —i%L. 
From the Fourier transform in Eq. (4.96) it can be seen that the effect of this operator on 
a momentum state is to transform k into k = —k%2TT. However, this operation does not 
commute with translation, which means that the eigenstates of the Hamiltonian cannot have 
good momentum and parity at the same time. From now on we assume that the number of 
lattice sites L is even. 

As a consequence, if the ground state has momentum |/C) 7^ or vr then it is doubly 
degenerate, forming a doublet with \fC). In what follows we will use this degeneracy to 
exploit both translational invariance and parity to reduce the dimensions of the matrices 
involved in the program. The following considerations are valid for every /C. We start with 
the simplest case, the IDM. 



4.3.2.1 Translationally invariant IDM vi^ith parity 

Single-particle space is built out of L different momentum states having up or down spin, 
\ka), for < A; < 27r and o" = ±2- If we transform to a basis with good parity, momentum is 
no longer a good quantum number: 



Ik^'a) = PNk {\ka) + 7r\ka)) , with < ^ < vr . 



(4.114) 



Two states, A; = and k = 7r[jare mapped on themselves, and only positive parity states can 



For the other states, < k < ir, 

To 



be formed with these momenta. They have norms ^Nk = 

both positive and negative parity combinations can be constructed, with norm pN^ = -y= 
take advantage of both symmetries at the same time, we define the IDM using an ensemble 
of the |/C) and \iC) states: 



P'k^k'^' 



T. 



Wi 



1 1 

2jS\- 



M pe± 



^^P^^\a^ a- , \q>^^P^^) 



SM, 



PNl 



Y^ [^kk' {pf + T^^'pf) + Kk' {^^'pf + T'pf 



pe± 



(4.115) 
(4.116) 



P'k 



pI 



from which it follows that Eq. (4.116) is diagonal in vr. If we now define: 



in which p| is a regular translationally invariant IDM as defined in Eq. (4.100). Because of 
parity symmetry we have, 

(4.117) 



1 



Pk 



pI'tpI 



(4.118) 



the translationally invariant IDM with good parity reduces to: 



Pk- 



1 



{pk + Pk) 



(4.119) 



which can be seen to be independent of parity. The IDM is still diagonal in /c, but less 
elements have to be stored, since for < fc < vr there is a degeneracy in parity. 

^Note that we use tt for both the parity quantum number, vr = ±1 as for the transcendent number; in what 
foUows it is always clear from the context what interpretation should be given to tt. 
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4.3.2.2 Translationally invariant 2DM with parity 

In contrast to parity in atomic systems, the parity of a two-particle state for translationally 
invariant systems is not the product of the parities of the single-particle states building up 
the two-particle state. Instead, the parity is inherently a two-particle property, and the 
single-particle states building up the two-particle states have no good parity: 

\ab;SK'') = ^N^i,{\ab;SK)+7r\ab;SK)) , (4.120) 

with 

< i^ < vr and < a, 6 < 27r . (4.121) 

In general the 2DM is defined using a pIC ensemble: 

J-afe;cd — 2^ ^« 2 [512 2^2^ ^-SX,* I afe ^cd \^SM,i)^ {^.iZZ) 

i '- ' M pe± 

where 

B^T = 'N^, (b^T + vriJtf ) , (4.123) 



and with B' defined as in Eq. (4.99). Because of this p/C-ensemble definition and the fact 
that parity symmetry implies that, 

one sees that the 2DM becomes diagonal in two-particle parity. As was the case for the IDM, 
the K = and vr are mapped on themselves, but because the single-particle momenta a, b 
change, both positive and negative parity combinations can now be formed. Let us take a 
look at the different possibilities: 



K = 0: for K = the single-particle indices in Eq. (4.120) have to satisfy: 

{a + b)%2iT = or a = b. (4.125) 

This means that K = states can be written as: 

\aa;SO'') = ^N^^{\aa;SO) + TT\aa;SO)) , (4.126) 

in which the second term is equal to the first but with exchanged single-particle indices. From 
previous discussions we know that the symmetry of the two-particle state under the exchange 
of the single-particle indices depends on the two-particle spin, i.e. 

\aa; SO) = {-lf\aa; SO) . (4.127) 



One can see from Eq. (4.126) that for S = only the positive parity states, and for S = I 
only negative parity states remain. The norm is given A^^^ ~ 2' ^^ ^^^ K = case, the 
definition of the parity-symmetric 2DM as a function of the regular translationally invariant 
2DM then reduces to: 

^ab;cd = ^n(-l)s'rab;cd ■ (4.128) 
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< K < tt: for < K < tt the first and second term in Eq. (4.120) consist of different 



single-particle indices a j^ b, implying that both positive and negative parity combinations 
can be constructed, with norm ^N^ = ^. As shown for the IDM, the p/C ensemble makes 
the 2DM diagonal in, and independent of, parity. Hence every block is twofold degenerate. 
Since K ^ K there are only two terms remaining in the definition of the parity-symmetric 
2DM: 

r£ = ^te + rg,) . (4.129) 



K = tt: Finally, for this block K again equals K. In this case there is always one state 
that is mapped on itself, and for which only a positive parity combination can be formed, 
i.e. lOvr; S'vr"''), with norm '^ Nq^ = i^. For all the other states in this block both positive 
and negative parity combinations can be formed, with norm N'^^^ = -j=. Because of the pK, 

ensemble, the 2DM falls apart in a positive and negative parity block, and since K = K, four 
terms remain in the definition of the 2DM: 



r^TT _ i ATTT i ATTT 

-L ab:cd — ^^ ab ^^ cd 



■^Sn 



■^Sn 



ab;cd + ^aicd + ^ 



"pS'TT |_ 'pSn 

ab:cd ab:cd 



(4.130) 



We observe from Eq. (4.130) that the original K 

and S 



parity block, for both the S 

between the positive and negative parity block! 



TT block reduces to a positive and negative 
1 part. Also note that there is no degeneracy 



4.3.2.3 Parity symmetric form of the two-index constraints 

Before deriving the parity-symmetric form of the two-index constraints, the derivation of the 
parity-symmetric form of the IDM out of the 2DM is considered. Expressed as a function of 
the regular translationally invariant 2DM, the parity-symmetric IDM is given by: 



Pd=7, {Pa + Pa) 



1 



N 



5 b 



-*- ab;ab ' ^ ab\ab 



(4.131) 



From the previous Section it can be deduced that, for all values of K: 



ySK I ySK _ ^ -^ 

^ ab;cd "•" -*- ab;cd o r ATK F AtK 
^ ^^ab ^^cd 



^SkM , ■p5A'(-) 



ab:cd 



+ r: 



ab;cd 



(4.132) 



from which it follows that: 



Pd 



N 



^ v^ [S? sp i""j y^ 

_ 1 Z^ 2 ^4 rATK"^ ^ 

S b ^ ^^ab TT 



"^) Y^ p5i^ 



ab\ab 



(4.133) 



Bear in mind that the single-particle momentum a in the above equations is in the range 
< a < TT, but the momentum over which is summed, b, runs over the entire range < 6 < 27r. 
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The Q condition: the Q matrix is analogous to the 2DM T, and the expression of the 
parity-symmetric Q as a function of the regular translationally invariant Q matrix is exactly 



the same as for the 2DM in Eqs. (4.128), (4.129) and (4.130). The expression of the parity- 



symmetric Q map as a function of the parity-symmetric 2DM is then easy to derive using 



Eq. (4.103), leading to the following expression: 



Q(r)f£ 



-'- ab;cd + 



\7pypy 



2Tr r 

N{N-1) 



Pa-P'b 



(4.134) 



This expression is valid for all values of K. The only difficulty is that the indices on the 
left-hand side can have the values < a < 27r, while for the IDM only terms < vr are stored. 
This is solved by using the tilde above the index, by which is implied that: 



if a < vr 
if a > TT 



(4.135) 



The Q condition: the parity-symmetric form of a particle-hole state is defined as: 



\ah-SK^) 



5atK 



'N. 



ab 



a)a ® flfe 



SK 



+ 



aA (X" ai 



SK\ 



)|0) 



(4.136) 



in which the hole operator aka is defined as in Eq. (4.104). Using this parity-symmetric 



particle- hole operator the Q map is defined in a pK, ensemble, which once again renders the 
matrix diagonal in particle-hole parity. The particle-hole states can be divided into two 
classes, on the one hand -ftT = or vr, and on the other hand those states which are mapped 
on a different momentum. For simplicity we first consider this last class. 



< K < tt: in this case one can construct both positive and negative parity combinations, 
with norm N^ = —?= . The resulting Q matrix contains only two terms, because of momentum 
conservion, and is independent of particle-hole parity, so every block is twofold degenerate: 



^(r)f£=2[^(r)f.^. + ^(r)& 



(4.137) 



This implies the following expression of the Q map as 
2DM: 



function of the parity-symmetric 



^(r) 



ab:cd 



^/(ad)p)_^ , 

-OacObdPa ^ ^^j^, p^^, 2_^[a J 



ad 



cb S' 



1 1 



S 
S' 



EYS'Ki""' 
ad;cb 



(4.138) 



K = and K = tt: both the K = and K = n blocks are mapped on themselves. For 
K = the action of the parity operator is again to exchange the single-particle momenta, 
but in contrast with the two-particle case, there is no symmetry between the particle and the 
hole index. As a consequence positive and negative parity combinations for both K = and 
K = TT can be constructed, with norms N^ = -y=. There are a few exceptions however: for 

K = 0, the states with a = b = and a = b = tt, and for K = tt the states with a = 0, 6 = vr 
and a = 7T,b = 0, are mapped on themselves and only occur in the positive parity block. 
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with norm ^ N^ 

function of the regular translationahy invariant Q is: 



i. The general form of the parity-symmetric Q map when K 



XV , clS H- 



^(r) 



ab:cd 



'n!^, 'n^. 



\SK 



Qirfa^M + Qim-ci + - [QimM + ^(r) 



\SK 



)SK 
ab;cd 



(4.139) 



In this case the expression of ^ as a function of the 2DM is a bit more complicated: 



G{r)!g: =Sac5MP. - 'N^b 'N^ Y.iS' 
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Viad){cb) ^ j.s'K"-" 

4 r^Y/C" rjSlK" Z^ '- ad;cb 



ad 



'cb 



(4.140) 



4.3.2.4 Parity symmetric form of the three-index constraints 

The construction of a complete parity-symmetric three-particle basis involves a lot of book- 
keeping. The main problem is that the parity transformation acting on the states as defined 



in Table 4.1 does not always transform them into states that are either identical or orthogonal 
to the original state, as was the case in the two-particle and particle-hole space. We define a 
parity-symmetric three-particle state as: 



\abc- {Sab)Sk^) 



T.^SK 



ab{Sab)c 



\abc; {Sab)SK) + 7r|a6c; {Sab)SK)) 



(4.141) 



A description of the reduction of the three-particle basis in a parity-symmetric form is given 
by considering three cases separately. 



K = 0: the spin-symmetric basis was built up of states as shown in Table 4.1 The same 
ordering holds in every K block when translational invariance is included, but only states for 
which (o + 6 + c)%2tt = K are allowed in the block. 

If S = 2i both Sab = and Sab = 1 are possible intermediary spins. For Sab = there are 
two possible configurations: a = b ^ c and a < b < c. For Sab = 1 only the latter configuration 
remains. For the first type of configuration, i.e. |aa6;(0)^0), the parity operation leads to 
a one-to-one mapping of states with a < vr on states with a > tt. These states are linearly 
independent, which means both positive and negative parity states can be formed, with norm 

the state IttttO; (0)iO), is mapped on itself, and 



^^aa{0)b 



—7=. The state with a 
v2 



TT, I.e. 



Ao 



1 

2- 



only a positive parity state can be formed, with norm ^^-^^ i-q^q 

For a < b < c, both positive and negative parity states can be formed when a > 0. 
However, when a = 0, the state is mapped by a parity transformation on a non-orthogonal 
state: 



\Oab;iSab)lo) 



[Sab]{-l)'-'Y.^SacK-l)'-!^l I f^'^\Oab;iSac)lo). (4.142) 



Constructing good parity combinations out of these states, as in Eq. (4.141), mixes up the 



intermediary spin quantum number Sab, which can no longer be used to label our basis. This 
is best avoided, and it is more convenient to use the configuration: 



\abO; iSab)SO^) 



abO; iSab)SO) 



7r\baO;iSab)lo) 



(4.143) 
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when one of the momenta is zero. These states have good parity while simultaneously keeping 
Sab as a good quantum number. When Sab = only the positive parity combination is 
possible, when Sab = 1 only the negative combination remains. 

For 5 = 1, the states are completely antisymmetrical in the momentum indices abc, and 
the construction of a parity-symmetric basis becomes much simpler. The configurations a < 
b < c span a complete basis. When a > 0, parity maps these states on linearly independend 
states, implying that both positive and negative parity states can be formed. When a = 0, 
the action of the parity operator is: 

P\Oab- (1)^0) = \Oba; (1)^0) = -|0a6; (1)^0) , (4.144) 

so only negative parity combinations can be formed. 

< K < tt: when the momentum of a three-particle state is between and vr, it is mapped 
by the parity operation on an orthogonal state, and both positive and negative parity states 



can be constucted using Eq. (4.141) 



K = tt: three-particle states with K = n are mapped on states which are not necessarily 
orthogonal, so analogous problems arise as for K = 0. 

For S = ^ and Sab = 0, we have the configuration a = b ^ c. There is one state of this type 
which is mapped on itself and for which only a positive parity combination can be formed, 
i.e. lOOvr; (0)|7r). For the other states of this type both positive and negative combinations 
can be formed. When a < b < c there is no problem if 6 7^ tt. If 6 = tt, however, the parity 
operator transforms the state into a non-orthogonal state: 

|a7r&;(5„fc)^7r) = [5,,](-l)^-^[5„e](-l)^"^|j f ;^^|^| lavrft; (5,,)^7r) , (4.145) 

which means that a parity-symmetric state will again mix up states with different intermediate 
spin. To avoid this we use a different configuration in this block when one of the indices is 
equal to vr: 

|a67r; (5,^,) J^^) = I (\ab7T; (S,fe) Jvr) + 7r|6a7r; (S^t);^^)^ . (4.146) 



2 ' 2 V ■ ^ ""' 2 ' ' ' ^ ""' 2 , 
This state has good parity and intermediate spin and only two states remain, i.e. the positive 
parity state for Sab = and the negative parity state for Sab = 1- 

When (S = I the complete antisymmetry in the momenta again makes the construction 
of a parity-symmetric basis easier. Almost all states are mapped on a orthogonal state, so 
positive and negative parity combinations can be formed. Only when one of the indices is 
equal to vr, the positive parity state vanishes because of the antisymmetry. 

The 7i map: the parity-symmetric translationally invariant 7i condition is defined using 
a pIC ensemble: 



'n>^)abc;dez ~ l^^^^XSV ^ ^ \\^ SM,i \^ ab{Sab)c^de{Sd,)z\'^ SM,i I 

+ \^5A4,i \^de(Si,)z^ ab{S,t,)c\'^ SM,i I j ' (4.147) 
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where 



i?t 



ab{Sab)c 



iSK 



vSK 



^^ab{Sab)c [^ ab{Sab)c + ^-^ abiS,b)c) ' 



(4.148) 



and with B^ defined as in Eq. (4.43). Because of the p/C ensemble the 7i condition is diagonal 



in both parity and momentum K. For < K < tt the parity-symmetric 7i is expressed as a 
function of the regular translationally invariant 71 as: 

mnabc;dez " ^ab{S,b)c ^de{Sae)z [^nnabc;dez + ^H^).^ -__ I , (4.149) 



which is independent of parity, and as such doubly degenerate. The blocks with K = or ir 
are split up in a positive and negative parity block. Since K = K four terms remain in the 
definition of the parity-symmetric 71 : 



1'^ ' abc\dez «uia ,\„ 
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S(Sab',Sde}K 
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(4.150) 



When expressed as a function of the 2DM, the part of the 71 condition that is a function of 
the IDM (which we call T^^) is the same for all K values: 



7-^P(T^\SK{Sab\Sde) 
1 \^ )c 



'abc:dez 



T-iAT^.f. , r^NSK 

ab(Sab)c de{Sde)z 



SczSSabSde 

yj{ab){de 
1 

2 



{Kd^be + {-if^'SaeS, 



bd 



f 2Tr r 

\N{N - 1) 



Pa- Pi- Pc 



[Sab][Sde] is I ^A{5aJcd6be + {-lf'^6a.6,e6bd + {-lf^'6,Jae6cd 



1 

2 2 



y^{ab){de 
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(4.151) 



The remaining part, which is a function of the 2DM, is different when < X < vr or i^ = 0, vr. 
The parity-symmetric express ion of this part of the 71 map can be obtained by replacing all 
2DM terms appearing in Eq. (4.108) by their parity symmetric counterparts. For < K < tt 
the correct replacement is: 



"cz-L ab:de 



.S„.bK\ 



Ti ivfSK Ti ivfSK 

r 'Vb(5a6)c ^''de{Sde)z ST^ T^^abJ^ab 

^ ^^ab ^^de tt' 



(4.152) 



For X = or TT, the replacement becomes: 



X -pSabKab 

"cz-L abide 



Ti l^^SK Ti atSK I "^c^ Z2n' ^ab;de 
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(4.153) 



The 72 map: the construction of a parity-symmetric two-particle-one-hole matrix is easier, 
as there is no symmetry between the particle and hole indices. A two-particle-one-hole state 
with good parity can be constructed through: 



\abc; iSab)SK^ 



Ta atSK 



'-N. 



ab{Sab)c 



\abc; {Sab)SK) + 7r\abc; {Sab)SK)) 



(4.154) 
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When < i^ < vr all states are mapped on orthogonal ones by the parity operator, so 
positive and negative parity states can be formed. For i^ = or vr, some states are mapped 
on themselves, i.e. for K = 0: 



IOtttt; iSab)SO) , |000; iSab)SO) 



and 



IvrvrO; {Sab)SO) , 



and for K = ir: 



\00tt; {Sab)STT) , |0^0; {Sab)Sn) 



and Ivrvrvr; (Sab) Sir) 



For these states only positive parity combinations can be formed, with norm N^^^ \c~ 2' 
There is another type of state for which not all parity combinations can be formed, because 
of the symmetry between the first two indices. For iT = this is: 



\am- {Sab)S^) = l&aO; (Sab)SQ) = (-l)^"''|a60; (5„,)50) 



(4.155) 



and for ET = vr: 



laSvr; (5,^)5^) = \ba^- {Sab)S7:) = {-lf-'^\ah^- iSab)Sn) 



(4.156) 



For these states, if Sab = only positive parity combinations can be formed, and if Sab = 1 
only negative combinations are allowed. The Ti condition is defined using the pfC ensemble: 
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(4.158) 



and with B^ defined as in Eq. (4.110). There are two different expressions of the Ti map. 



depending on the momentum of the block K. As before, when < K < tt the blocks become 
degenerate in parity, and the Ti map is defined as: 



^SiSab^SaeW ^ T2]yK 
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. (4.159) 



The K = Q and K = tt blocks, however, split up in a positive and negative parity part, which 
are not degenerate: 



^S{S,b\Sa,)K^ ^ Tij^K 
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. (4.160) 



Expressed as a function of the 2DM, the parity-symmetric 72 map stays largely unchanged 



from Eq. (|4.111|). The only thing one has to do, is replace the translationally invariant IDM 

Pc^Pc, (4.161) 



by its parity-symmetric counterpart: 
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replace the first 2DM term as: 
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(4.162) 



if < -fC < vr or: 
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(4.163) 



if X = or IT, and replace the four remaining 2DM terms by: 
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if < -fC < vr and by: 
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(4.165) 



if X = or vr. 



The 72 map For the parity-symmetric T2 map there is only a small change compared to 



Eq. (4.112). As derived in the previous paragraph for the Ti constraint, the terms with 



< K < TT become twofold degenerate in parity and only half of them needs to be stored. 
The terms with K = or n fall apart in a positive and negative parity block. The change 
in the parity-symmetric 7^' compared to the regular translationally invariant 7^' is that for 
K = and vr, only the positive parity part is different from the 72 map. The blocks that are 
different than the standard 72 have the form: 



-T-//-p\ 2 {^ab\Jde)k 
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The expression of the parity-symmetric w as a function of the 2DM is: 
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(4.167) 
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for < k < IT and: 



abc;k ^pj, ab(Sab)c I 9 T AT^ab FlVf^ab 

^ ab kc 



S bK'^- 

+'^ j? ^? 5 (4.168) 

a6 fcc 

for fc = TT or 0. 

The listing of all the above expressions is quite tedious but must be done for completeness 
and future work in this area. The extra bookkeeping connected with including additional 
symmetries pays off, however, and leads to considerable speedup factors. 



CHAPTER 5 



Applications 



In this Chapter the A^-representabihty constraints and semidefinite programming algorithms 
developed in the previous Chapters are applied to some physical systems. In the first Section, 



the spin and angular momentum symmetry decomposition discussed in Section 4.2 is used 
to study the isoelectronic series of Beryllium, Neon and Silicon, see also [66]. The next 
Section deals with the dissociation of diatomic molecules, and the failure of the standard N- 
representability conditions to describe the dissociation limit |67]. An analysis is made as to 
why these constraints fail, and new conditions are derived to fix this pathological behaviour 
|29^l68j. In the last Section the one-dimensional Hubbard model is studied, exploiting all the 



symmetries discussed in Section 4.3 The performance of the different constraints discussed 
in Chapter [2] is studied for various fillings of the lattice, and it is found that higher-order 
constraints are needed to correctly describe the strong-correlation limit. 

5.1 The isoelectronic series of Beryllium, Neon and Silicon 

The isoelectronic series of atomic systems consitute a well-known benchmark problem in 
electronic structure theory, and a good test for the performance of any many-body method. 



The atomic Hamiltonian is given by Eq. (4.58). For a fixed number of electrons N, one can 



consider the central charge Z as a variable, and this range of systems is called the A^-electron 
atomic isoelectronic series, though usually it is named after the neutral atom species, e.g. the 
Beryllium series for A^ = 4. If the interaction between the electrons is neglected, we recover 
the hygdrogenic Hamiltonian. An interesting thing about the Hydrogenic Hamiltonian is 
that there is an accidental symmetry, which leads to an additional degeneracy in the single- 
particle spectrum, corresponding to the main quantum number n. When the interelectronic 
interaction is switched on, however, this symmetry is broken and the degeneracy disappears. 
Using the electronic series, one can look at the transition between these two regimes. Starting 
from the neutral atom, Z = N, the central charge is increased, which is equivalent to scaling 
down the electron interactions. In the limit Z — )• oo the hydrogenic Hamiltonian is recovered. 
For Beryllium, the incipient degeneracy of the 2s and the 2p levels causes the ground state 



to be increasingly multireference as Z increases (as is shown in Fig. 5.1), which is hard 
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Figure 5.1: Schematic representation of the influence of the increase of Z for the ground state of 
BeryUium. When Z = N, the ground state is dominated by a single reference (ls)'^(2s)^ singlet. Tl 
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to describe for many methods, including Hartree-Fock and DFT. Neon has ten electrons, 
and the ground state therefore has a closed shell (ls)^(2s)^(2p)^ configuration, which means 
we don't expect any near-degeneracy problems. For Silicon which has 14 electrons in a 
(ls)^(2s)^(2p)^(3s)^(3p)'^ configuration, there is not only a near-degeneracy problem, but the 
additional complication that the ground state is not a spin and angular momentum singlet. 

In this study the focus lies on three issues: the performance of the standard IQG N- 
representability conditions in multireference situations, the quality of the variationally ob- 
tained 2DM, which we inspect by looking at different physical properties obtainable from the 
2DM. The third issue is the basis set dependence of the results, which we are able to check 
because we are not limited to small basis sets through the use of spin and angular momentum 
symmetry. The specific SDP algorithm used in this study is the simple dual-only potential 



reduction program explained in Section 3.4.2 



5.1.1 Spin and angular momentum constraints 

Because we know the ground-state spin and angular momentum of the systems studied, we 
can improve results by restricting our search to these symmetry sectors, as discussed in 
Chapter [4} This is done by imposing linear equality constraints, as explained in Section 
3.4.2.4[ the explicit form of which is derived in this Section. 
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5.1.1.1 Imposing the spin constraints for 5 = 

The spin-coupled form of the Sz operator can be written as: 



(5.1) 



which is a particIe-hoIe operator as defined in Eq. (4.38). This operator lives in particle-hole 
space, and we can force the vector 



{5.}f, - ^ 



Ssi^ab 



to be an eigenvector of G{T) with eigenvalue zero. This can be seen from the fact that 
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(5.2) 



(5.3) 



when the |^f^) are spin singlets. In doing this we automatically impose the same constraints 
on ^(r) for Sx and Sy due to the threefold degeneracy of the 5 = 1 block of the 2DM. It can 
be seen that in this case the expectation value of the total spin is zero, 
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So the condition to be imposed on the density matrix becomes: 
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(5.4) 



(5.5) 



We will call this necessary condition for T to be derivable from a singlet wave-function ensem- 
ble the projection of a two-particle density matrix on a spin-singlet state. It can be seen that 
there are as many constraint matrices as the dimension of IDM space. Because of the zero 
eigenvalues in the Q matrix the projected density matrix is on the edge of the feasible region 
during the whole of the minimization process, and as a result, the cost function is infinity. 
This is easily circumvented by taking the pseudo-inverse of the G matrix, which excludes the 
52-state from the inversion process. 

The linear constraints for imposing the spin-singlet condition are given by: 



V A: < / : Tr r Me) = 

in which the constraint matrices ^''^^E have the following form: 



(5.6) 
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(5.8) 
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5.1.1.2 Imposing the spin constraints for 5 7^ 



For higher-spin multiplets we use the spin-averaged ensemble (see Section 4.1.2), in which the 
2DM has the same simple structure as for the singlet case. The expectation value of the S"^ 
spin operator is forced to be exact, using the linear constraint 

TY rjcS^} = 5(5 + 1) , (5.9) 

where {S^} is the two-particle matrix representation of the S^ operator, 

'3 /2-iV' 



2tS 



{S^} 



ab:cd 



2 \N- 1 



+ S{S + 1) 



{SacSbd + {-irSadSbc) . (5.10) 



There is only one linear constraint for nonzero spin, in contrast to the numerous constraints 
for the projection on a singlet state. It can therefore be expected that the spin constraints 
{i.e. the constraints on the 2DM ensuring that it is derivable from a wave function with 
good total spin) are less accurate than those for the singlet case. It is, in fact, known how 
to cure this situation [69J by considering not the spin-averaged ensemble but rather the 2DM 
derived from the highest- weight member (Ai = S) of the multiplet. Similar to the spin-singlet 
projection, one can then impose the condition that, since the spin-raising ladder operator 5+ 
destroys the wave function, the G(r) matrix must have a zero eigenvalue (with an eigenvector 
in particle- hole space corresponding to the 5+ operator). In such a highest-weight scheme, the 
spin restrictions for the 5 7^ case are put on the same footing as for the singlet case; in fact, 
the highest-weight and the spin-averaged ensemble scheme are equivalent for the singlet case. 
However, the highest- weight scheme for 5 7^ requires one to keep track of more matrices and 
is computationally more demanding by about a factor of 10. We therefore used the ensemble 
scheme even for the nonsinglets (i.e. the Si atom), though we checked some cases with the 
highest-weight method for the spin. 

5.1.1.3 Spin and angular momentum projection 

Orbital angular momentum is an additional conserved quantum number in atomic systems. 
The same principles as in the previous Section can be used, but the implementation is a bit 
more complicated. It can be shown that in a spin and angular momentum coupled basis the 
z-projections of S and £ become 



£. 



1 ^^^ r + 1(0+1) 

-T^E^K^^H' , (5.11) 

nl 
^ ^r 1(1 + 0) 

Y.[l]l[<i^ani\ . (5.12) 



Following the same argument as before, it can be imposed that the density matrix is derivable 
from an eigenstate with zero eigenvalue of respectively the S and C operators when 

Eilc]g{TtZ^ = 0, (5.13) 

c 

T.i^c]icg {T^Z^ = 0. (5.14) 
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This can be translated into linear constraints on the 2DM, given by: 



y k<l 



Tr r 



[kl] 
J 



E 







(5.15) 



where the constraint matrices ^'^^^E have the following form: 



[kl] j^{L^S) 
J^ab;cd 



JJab;cd +^{-^) [ jJba;cd + [ jJba;dc +^[-^) [ jJab;dc 



[kl] AL^'S) 



S) 



+ 



[kl] fL-^S 
jJ cd;ab 



JJba;dc 

^( -\\S+L ([kl] fL^s\ j_ ([kl] fL''S\ , ^( -t^L+S ([kl] fL''S\ 
^\-^) y jJdc;abJ + y jJdcMj ~^^^~^) \ JJcd-Mj ' 

(5.16) 



where J^ can mean either 5 or C The parity vr has been introduced in Section |4.2[ The 
precise expression for the /-coefficients reads: 

1 [h] U 5 ^1 



[kl] AL-S) _. 
SJab;cd —"hh 



[kl] AL-s) _, 

£,Jab:cd —"hk 



2N 

N -I 



Sak^dhd 



1 



l-n ~^ l-h ~ L' 



^akSclhd ■ 



(5.17) 
(5.18) 



The projection on nonzero spin and angular momentum is a less strict condition, using the ap- 



propriate ensemble averaged over the third-component of C (see Section 4.2 ). The expectation 
values of C and S are projected on the desired values: 

Trr{52} = 5(5 + 1) , (5.19) 

Trr{£2} = £(£ + 1) , (5.20) 

where the {5^} and {>C^} are the two-particle matrix representations of the 5^ and C^ oper- 
ators respectively: 



ab\cd 



{5^ 



{C-]^'^ 



< ab\cd 



{&achd + <-lr^^5ad&bc) 
((^aAd + Vr(-l)^+^(^„Ac) 



\W-^\ ,sA 


1 

p 


(5.21 
(5.22 





5.1.2 Results and discussion 

Using the dual-only potential reduction algorithm, the isoelectronic series of Be, Ne and Si 
were calculated from the neutral atom up to a central charge Z = 28. Beryllium and Neon 
are both elements with a singlet ground state. In the Silicon ground state the total spin 
and angular momentum are both one, which allows us to assess the quality of the spin and 
angular momentum constraints for 5, £ 7^ 0. In order to study the basis set dependence, 
the properties of the ground state of the Be and Ne series were calculated in a cc-pVDZ, 
a cc-pVTZ and a cc-pVQZ basis set [M]. The Si series was only calculated in a cc-pVDZ 
and a cc-pVTZ basis set [70]. Spherical harmonic (and not Cartesian) basis functions are 
used throughout this Section. With the density matrices obtained from the v2DM, several 
properties were studied. These are compared to estimates for non-relativistic energies based 
on experimental data [7I1I72], and to the results of coupled-cluster (CCSD) calculations, and 
in some cases, to full-configuration-interaction (full-CI) calculations. 

The basis functions used were those of the neutral atom, but with a rescaling r — )• rZ/N 
for the positive ions with Z > N. The CCSD and full-CI results were obtained using the 
MOLPRO program [73j. 
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Table 5.1: Ground-state energies of the Be series in the cc-pVDZ basis set using different methods. 



z 


v2DM 


HF 


CCSD 


full-CI 


expt. 


4 


-14.617473 


-14.572338 


-14.617369 


-14.61741 


-14.66736 


5 


-24.275712 


-24.216056 


-24.27566 


-24.275684 


-24.34892 


6 


-36.387458 


-36.316267 


-36.387421 


-36.387439 


-36.53493 


7 


-50.940925 


-50.860695 


-50.940896 


-50.940909 


-51.22284 


8 


-67.931909 


-67.844323 


-67.931884 


-67.931896 


-68.41171 


9 


-87.358767 


-87.265015 


-87.358746 


-87.358755 


-88.10113 


10 


-109.22078 


-109.12175 


-109.22076 


-109.22077 


-110.29089 


11 


-133.51761 


-133.414 


-133.51759 


-133.5176 


-134.98088 


12 


-160.24908 


-160.14145 


-160.24906 


-160.24907 


-162.17102 


13 


-189.41511 


-189.30392 


-189.41509 


-189.4151 


-191.86127 


14 


-221.01564 


-220.90129 


-221.01563 


-221.01564 


-224.0516 


15 


-255.05067 


-254.93347 


-255.05066 


-255.05066 


-258.742 


16 


-291.52018 


-291.4004 


-291.52017 


-291.52017 


-295.93244 


17 


-330.42417 


-330.30206 


-330.42416 


-330.42416 


-335.62293 


18 


-371.76264 


-371.63841 


-371.76263 


-371.76263 


-377.81344 


19 


-415.5356 


-415.40942 


-415.53559 


-415.53559 


-422.50398 


20 


-461.74304 


-461.61508 


-461.74303 


-461.74304 


-469.69455 


21 


-510.38498 


-510.25537 


-510.38498 


-510.38498 


-519.38513 


22 


-561.46143 


-561.3303 


-561.46142 


-561.46142 


-571.57572 


23 


-614.97237 


-614.83983 


-614.97237 


-614.97237 


-626.26633 


24 


-670.91783 


-670.78398 


-670.91783 


-670.91783 


-683.45695 


25 


-729.29781 


-729.16274 


-729.2978 


-729.2978 


-743.14758 


26 


-790.1123 


-789.97609 


-790.11229 


-790.1123 


-805.33822 


27 


-853.36132 


-853.22404 


-853.36131 


-853.36131 


-870.02886 


28 


-919.04486 


-918.90659 


-919.04486 


-919.04486 


-937.21951 



5.1.2.1 Ground-state energy 

The ground-state energies, calculated with various methods and in the different basis sets, 



are shown in Tables 5.1, 5.2 and 5.3 for Beryllium, Tables 5.4, 5.5 and |5.6| for Neon and 



Tables 5.7 and 5.8 for the Silicon isoelectronic series. Even in the best case (Be in cc-pVQZ), 



the calculated energies are at least 20 niHartree removed from the experimental estimate in 
|71t [72] . This is due to the difficulty of describing the interelectronic cusp in the exact wave 
function using finite single-particle basis sets. 

More relevant is the difference between the v2DM (and CCSD) energies as compared to 
full-CI in the same basis set. This is shown in Figure [5^2] for the case of the Be series. Note 
that the CCSD energy is always above, the v2DM energy below, the full-CI energy For 
v2DM, this simply reflects the nature of the variational problem. For the smallest cc-pVDZ 
basis set, v2DM and CCSD have about the same level of accuracy. The difference with full-CI 
grows as the basis set size increases for both CCSD and v2DM, but this effect is worse for 
the v2DM. 

As far as the Z-dependence is concerned, the trend differs markedly for the cc-pV(D,T)Z 
and for the cc-pVQZ basis set. As Z increases there is a growing accuracy for the smaller basis 
sets in both CCSD and v2DM, whereas for cc-pVQZ the accuracy decreases for CCSD and 
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Table 5.2: The ground-state energies of the Be series in the cc-pVTZ basis set using different 
methods. 



z 


v2DM 


HF 


CCSD 


full-CI 


expt. 


4 


-14.625431 


-14.572873 


-14.623559 


-14.62381 


-14.66736 


5 


-24.300695 


-24.234557 


-24.299207 


-24.29943 


-24.34892 


6 


-36.473162 


-36.394215 


-36.471944 


-36.47214 


-36.53493 


7 


-51.137349 


-51.045734 


-51.136311 


-51.136486 


-51.22284 


8 


-68.290965 


-68.186797 


-68.290046 


-68.290206 


-68.41171 


9 


-87.932793 


-87.816285 


-87.931958 


-87.932107 


-88.10113 


10 


-110.06209 


-109.93353 


-110.06132 


-110.06146 


-110.29089 


11 


-134.67837 


-134.53811 


-134.67764 


-134.67778 


-134.98088 


12 


-161.7813 


-161.62969 


-161.78061 


-161.78074 


-162.17102 


13 


-191.37066 


-191.20808 


-191.37 


-191.37011 


-191.86127 


14 


-223.44627 


-223.27309 


-223.44563 


-223.44574 


-224.0516 


15 


-258.00801 


-257.82461 


-258.0074 


-258.00751 


-258.742 


16 


-295.05582 


-294.86255 


-295.05522 


-295.05533 


-295.93244 


17 


-334.58961 


-334.38682 


-334.58904 


-334.58913 


-335.62293 


18 


-376.60935 


-376.39737 


-376.60879 


-376.60888 


-377.81344 


19 


-421.115 


-420.89414 


-421.11445 


-421.11454 


-422.50398 


20 


-468.10654 


-467.87712 


-468.106 


-468.10609 


-469.69455 


21 


-517.58394 


-517.34625 


-517.58342 


-517.5835 


-519.38513 


22 


-569.5472 


-569.30152 


-569.54669 


-569.54677 


-571.57572 


23 


-623.99631 


-623.7429 


-623.99581 


-623.99589 


-626.26633 


24 


-680.93126 


-680.67038 


-680.93077 


-680.93084 


-683.45695 


25 


-740.35204 


-740.08394 


-740.35156 


-740.35163 


-743.14758 


26 


-802.25866 


-801.98356 


-802.25818 


-802.25825 


-805.33822 


27 


-866.65111 


-866.36925 


-866.65064 


-866.65071 


-870.02886 


28 


-933.5294 


-933.24098 


-933.52894 


-933.529 


-937.21951 
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Table 5.3: Ground-state energies of the Be series in the cc-pVQZ basis set using different methods. 



z 


v2DM 


HF 


CCSD 


full-CI 


expt. 


4 


-14.642807 


-14.572968 


-14.639589 


-14.640124 


-14.66736 


5 


-24.321254 


-24.236385 


-24.317643 


-24.31822 


-24.34892 


6 


-36.500934 


-36.40257 


-36.497178 


-36.497761 


-36.53493 


7 


-51.177145 


-51.065945 


-51.173335 


-51.173918 


-51.22284 


8 


-68.347448 


-68.22364 


-68.343621 


-68.344203 


-68.41171 


9 


-88.010503 


-87.87405 


-88.00667 


-88.007254 


-88.10113 


10 


-110.16555 


-110.01625 


-110.16171 


-110.1623 


-110.29089 


11 


-134.81213 


-134.64967 


-134.8083 


-134.80889 


-134.98088 


12 


-161.95 


-161.77398 


-161.94616 


-161.94677 


-162.17102 


13 


-191.57899 


-191.38895 


-191.57514 


-191.57575 


-191.86127 


14 


-223.699 


-223.49443 


-223.69514 


-223.69577 


-224.0516 


15 


-258.30998 


-258.09031 


-258.3061 


-258.30675 


-258.742 


16 


-295.4119 


-295.17653 


-295.40801 


-295.40867 


-295.93244 


17 


-335.00476 


-334.75303 


-335.00085 


-335.00153 


-335.62293 


18 


-377.08857 


-376.81977 


-377.08463 


-377.08533 


-377.81344 


19 


-421.66334 


-421.37673 


-421.65938 


-421.66011 


-422.50398 


20 


-468.72912 


-468.42388 


-468.72513 


-468.72588 


-469.69455 


21 


-518.28593 


-517.96121 


-518.28191 


-518.28269 


-519.38513 


22 


-570.33383 


-569.9887 


-570.32977 


-570.33058 


-571.57572 


23 


-624.87286 


-624.50634 


-624.86877 


-624.86961 


-626.26633 


24 


-681.90309 


-681.51414 


-681.89895 


-681.89983 


-683.45695 


25 


-741.42459 


-741.01206 


-741.4204 


-741.42132 


-743.14758 


26 


-803.43742 


-803.00013 


-803.43318 


-803.43414 


-805.33822 


27 


-867.94167 


-867.47832 


-867.93738 


-867.93838 


-870.02886 


28 


-934.93744 


-934.44663 


-934.93309 


-934.93413 


-937.21951 
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Table 5.4: The ground-state energies of the Ne series in the cc-pVDZ basis set using different 
methods. 



z 


v2DM 


HF 


CCSD 


full-CI 


expt. 


10 


-128.70843 


-128.48878 


-128.67964 


-128.68088 


-128.9376 


11 


-161.80049 


-161.59591 


-161.77283 


-161.77411 


-162.0659 


12 


-198.88784 


-198.70208 


-198.86199 


-198.86309 


-199.2204 


13 


-239.97194 


-239.80393 


-239.94802 


-239.94883 


-240.3914 


14 


-285.04223 


-284.88894 


-285.02004 


-285.02061 


-285.5738 


15 


-334.08381 


-333.94195 


-334.06299 


-334.06338 


-334.7642 


16 


-387.08194 


-386.94882 


-387.06219 


-387.06246 


-387.9608 


17 


-444.02427 


-443.89781 


-444.00531 


-444.00551 


-445.1622 


18 


-504.90101 


-504.77972 


-504.88268 


-504.88282 


-506.3673 


19 


-569.70474 


-569.58754 


-569.68689 


-569.68701 


-571.5754 


20 


-638.42981 


-638.31595 


-638.41239 


-638.41248 


-640.7891 


21 


-711.07205 


-710.96091 


-711.05497 


-711.05505 


-713.9988 


22 


-787.6282 


-787.51937 


-787.61143 


-787.6115 


-791.2132 


23 


-868.09589 


-867.98895 


-868.07932 


-868.07938 


-872.4291 


24 


-952.47304 


-952.36781 


-952.45674 


-952.45679 


-957.6463 


25 


-1040.7584 


-1040.6545 


-1040.7422 


-1040.7422 


-1046.8646 


26 


-1132.9505 


-1132.8479 


-1132.9345 


-1132.9345 


-1140.0838 


27 


-1229.0485 


-1228.9471 


-1229.0327 


-1229.0328 


-1237.3039 


28 


-1329.0518 


-1328.9514 


-1329.0361 


-1329.0361 


-1338.5247 



Table 5.5: Ground-state energies of the Ne series in the cc-pVTZ basis set using different methods. 



z 


v2DM 


HF 


CCSD 


expt. 


10 


-128.86088 


-128.53186 


-128.81081 


-128.9376 


11 


-161.97703 


-161.65496 


-161.92829 


-162.0659 


12 


-199.11372 


-198.79861 


-199.06598 


-199.2204 


13 


-240.26728 


-239.9582 


-240.22028 


-240.3914 


14 


-285.43166 


-285.12786 


-285.38525 


-285.5738 


15 


-334.6021 


-334.30313 


-334.55624 


-334.7642 


16 


-387.77553 


-387.48107 


-387.73017 


-387.9608 


17 


-444.95002 


-444.6598 


-444.9051 


-445.1622 


18 


-506.12426 


-505.83808 


-506.07977 


-506.3673 


19 


-571.29734 


-571.01502 


-571.25329 


-571.5754 


20 


-640.46864 


-640.18995 


-640.42496 


-640.7891 


21 


-713.63756 


-713.36231 


-713.59424 


-713.9988 


22 


-790.80365 


-790.53161 


-790.76063 


-791.2132 


23 


-871.96637 


-871.69743 


-871.9237 


-872.4291 


24 


-957.12544 


-956.85936 


-957.08305 


-957.6463 


25 


-1046.2804 


-1046.0171 


-1046.2383 


-1046.8646 


26 


-1139.431 


-1139.1702 


-1139.3892 


-1140.0838 


27 


-1236.5769 


-1236.3184 


-1236.5353 


-1237.3039 


28 


-1337.7178 


-1337.4616 


-1337.6764 


-1338.5247 
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Table 5.6: The ground-state energies of the Ne series in the cc-pVQZ basis set using different 
methods. 



z 


v2DM 


HF 


CCSD 


expt. 


10 


-128.92686 


-128.54347 


-128.87106 


-128.9376 


11 


-162.05038 


-161.67155 


-161.99595 


-162.0659 


12 


-199.19913 


-198.82303 


-199.14502 


-199.2204 


13 


-240.36392 


-239.98965 


-240.30977 


-240.3914 


14 


-285.53886 


-285.16605 


-285.48453 


-285.5738 


15 


-334.72067 


-334.3492 


-334.66615 


-334.7642 


16 


-387.90757 


-387.53731 


-387.85281 


-387.9608 


17 


-445.09826 


-444.72921 


-445.04331 


-445.1622 


18 


-506.29194 


-505.92402 


-506.23681 


-506.3673 


19 


-571.48788 


-571.12105 


-571.43261 


-571.5754 


20 


-640.68561 


-640.31973 


-640.63013 


-640.7891 


21 


-713.88444 


-713.51958 


-713.8289 


-713.9988 


22 


-791.08414 


-790.72018 


-791.02848 


-791.2132 


23 


-872.28418 


-871.92117 


-872.22853 


-872.4291 


24 


-957.48456 


-957.12224 


-957.42872 


-957.6463 


25 


-1046.6846 


-1046.3231 


-1046.6288 


-1046.8646 


26 


-1139.8845 


-1139.5236 


-1139.8285 


-1140.0838 


27 


-1237.0837 


-1236.7235 


-1237.0277 


-1237.3039 


28 


-1338.2821 


-1337.9226 


-1338.2261 


-1338.5247 



Table 5.7: Ground-state energies of the Si series in the cc-pVDZ basis sets using different methods. 
The resuhs under v2DM were calculated using the ensemble avaraged spin projection, those under 
v2DM* were calculated using the maximal weight method. 





v2DM 


v2DM* 


HF 


CCSD 


expt. 


14 


-288.93962 


-288.92921 


-288.84644 


-288.91895 


-289.359 


15 


-340.36765 




-340.27338 


-340.34709 


-340.872 


16 


-396.10801 




-396.01679 


-396.08749 


-396.869 


17 


-456.09635 




-456.00926 


-456.0759 


-457.337 


18 


-520.29362 


-520.27860 


-520.21067 


-520.27348 


-522.269 


19 


-588.68067 




-588.60149 


-588.66097 


-591.66 


20 


-661.24791 




-661.17202 


-661.22871 


-665.507 


21 


-737.99017 




-737.91714 


-737.97148 


-743.808 


22 


-818.90446 


-818.88808 


-818.83388 


-818.88621 


-826.559 


23 


-903.98889 




-903.92042 


-903.97105 


-913.762 


24 


-993.24222 




-993.1756 


-993.22475 


-1005.413 


25 


-1086.6636 




-1086.5986 


-1086.6465 


-1101.513 


26 


-1184.2525 


-1184.2381 


-1184.1889 


-1184.2357 


-1202.061 


27 


-1286.0084 




-1285.9461 


-1285.9919 


-1307.057 


28 


-1391.9311 




-1391.8699 


-1391.9147 


-1416.5 
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Table 5.8: The ground-state energies of the Si series in the cc-pVTZ basis sets using different 
methods. The results under v2DM were calculated using the ensemble avaraged spin projection. 





v2DM 


HF 


CCSD 


expt. 


14 


-289.02515 


-288.85215 


-288.9835 


-289.359 


15 


-340.50472 


-340.33467 


-340.46205 


-340.872 


16 


-396.43974 


-396.27384 


-396.39711 


-396.869 


17 


-456.80372 


-456.64236 


-456.7617 


-457.337 


18 


-521.58 


-521.42294 


-521.53858 


-522.269 


19 


-590.75683 


-590.60373 


-590.7159 


-591.66 


20 


-664.32396 


-664.17433 


-664.28328 


-665.507 


21 


-742.2714 


-742.12467 


-742.23072 


-743.808 


22 


-824.58949 


-824.44532 


-824.54882 


-826.559 


23 


-911.27007 


-911.12778 


-911.22912 


-913.762 


24 


-1002.3054 


-1002.1648 


-1002.2643 


-1005.413 


25 


-1097.6895 


-1097.5501 


-1097.6482 


-1101.513 


26 


-1197.4173 


-1197.2789 


-1197.3759 


-1202.061 


27 


-1301.4847 


-1301.347 


-1301.4431 


-1307.057 


28 


-1409.8882 


-1409.7512 


-1409.8466 


-1416.5 
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Figure 5.2: Difference between approximate (CCSD or v2DM) and fuU-CI energies for the Be 
series in all three basis sets. 
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becomes constant for v2DM. The reason for this difference is not clear, though it is probably 
connected to the incipient degeneracy of the 2s and 2p states and the quality of its description 
in the various basis sets, as is more fully explained in the next Section. It should be noted 
that the v2DM results are overall very accurate, even in the worst case {Z = 28, cc-pVQZ) 
differing less than 3 mHartree from full-CI. 

For the Ne series, full-CI calculations were only possible in the cc-pVDZ basis. From the 



results collected in Tables 5.4, 5.5 and 5.6 it is seen that the v2DM accuracy is significantly 
less than for Be, the largest deviation to full-CI (28 mHartree) appearing for the neutral 
atom. This is actually comparable to v2DM results for molecules under IQQ conditions, so 
it is likely that because of the small number of electrons the Be results are an exceptionally 



favorable case. This is also borne out by the Si results in Tables 5.7 and |5.8[ showing a 



maximal deviation between CCSD and v2DM energies of 21 mHartree for the neutral atom. 

5.1.2.2 Correlation energy 

Correlation energies were calculated by taking the difference of v2DM or CCSD energies with 
the Hartree-Fock results in the same basis set. The results labeled "experimental" are the 
estimates in 



Beryllium series: in Fig. 5.3 the v2DM correlation energy is shown as a function of central 
charge Z for the different basis sets. Note that on the plot the difference between the CCSD 
and full-CI correlation energies would not be visible. The experimental curve is linear in Z, 
as a direct consequence of the near-degeneracy of the ground state [72] . One can calculate 
a perturbative series expansion of the exact and Hartree-Fock energy in powers of -^; the 
corresponding series for the correlation energy starts with a constant if the hydrogenic ground 
state is nondegenerate, or with a linear term in Z in case of degeneracy. The v2DM correlation 
energy does not follow this trend: it goes linear in the beginning, but becomes concave in 
the cc-pVDZ and cc-pVTZ basis, or convex in the cc-pVQZ basis. This failure, however, is 
not related to the v2DM method as the trend is the same in full-CI. It simply reflects the 
fact that the incipient degeneracy is not well described in these basis sets. This can also be 
seen by calculating the Z = 1 hydrogen spectrum (corresponding to the Z — t- oo situation, 
when the electron-electron interaction can be neglected) in the basis sets: the 2s and 2p 
energies are not degenerate, but differ by 5.8 mHartree (cc-pVDZ), 2.0 mHartree (cc-pVTZ) 
and -2.3 mHartree (cc-pVQZ). Note that for cc-pVQZ the 2p energy actually drops below 
the 2s energy, explaining the different (convex/ concave) behavior of the curves. To make 
sure we also performed calculations in the cc-pVDZ basis after rescaling (r — )• ar) it in such 
a way that the hydrogenic 2s-2p degeneracy is exact. In this basis the v2DM correlation 



energy (also shown in Fig. 5.3) indeed has the correct linear behavior. It is clear from the 
above discussion that v2DM is indeed capable of providing accurate correlation energies in 
the presence of near-degeneracies, when other many-body techniques (like density functional 
theory or MP 2) can fail. 



Neon series: in Fig. 5.4 the correlation energy is shown for all three basis sets as a function 
of Z. Because Ne is a closed shell atom, there is no near-degeneracy for large Z values and 
the exact correlation should be asymptotically constant in Z, as is indeed visible in the 
experimental curve. Due to basis set effects, this constant behavior is imperfectly realized, 
but the v2DM follows the same trends as CCSD for all basis sets. Note that the approximation 
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Figure 5.3: v2DM correlation energy for the Be series in all three basis sets, and in a rescalcd basis 
set that exhibits hydrogen-like behaviour (degeneracy between the 2s and 2p level). For comparison, 
the CCSD and experimental values are also shown. Note that for the cc-pVDZ basis, CCSD and 
v2DM results coincide. 
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Figure 5.4: v2DM correlation energy for the Ne series in all three basis sets. For comparison, the 
CCSD and experimental values are also shown. 
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Figure 5.5: v2DM correlation energy for the Si series in the cc-pVDZ and cc-pVTZ basis set. For 
comparison, the CCSD and experimental values are also shown. 



to a constant behavior at large Z is best for the largest basis set. The decrease in correlation 
energy for increasing Z, in contrast to the slight rise in the experimental correlation energy, 
can be attributed to the fact that the basis sets were optimized for the neutral atom. While 
the rescaling procedure fixes the nuclear cusp, the resulting basis set is obviously far from 
optimal for highly charged ions. 



Silicon series: for silicon, only the cc-pVDZ and cc-pVTZ basis have been used [Fig. 5.5 



As was the case for Be, the theoretical linear rise with Z is thwarted by imperfections in 
the basis sets. However the v2DM correlation energy closely tracks the CCSD one. The Si 



ground state is a spin triplet. The results in Tables 5.7 and 5.8 have been obtained using 



the spin-averaged ensemble, as explained in Section 4.1.2 In order to assess the quality of 



the spin constraints, we have also performed calculations using the highest-weight method, 
for Z=14, 18, 22, and 26 with the cc-pVDZ basis set, the resulting energies are also reported 
The energy differences between the approaches are sizeable, with differences 



in Table 5.7 



as large as 20 mHartree, reflecting the weaker nature of the spin constraints imposed in the 
spin-averaged scheme. However, the discrepancy between the two approaches is stable for 
increasing Z. 
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Figure 5.6: Ionization energy scaled with -^, for the Be, Ne and Si series in the different basis sets 
compared with experimental results. 
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Figure 5.7: Single-particle levels obtained in a correlated Hartree-Fock-like scheme (see text) for 
the Be series in a cc-pVDZ basis set. 



5.1.2.3 Ionization energies 

It is important to gauge the quality of the 2DM by analyzing other properties than just 
the energy, e.g. the ionization energies of the different atomic ions, which can be easily 
calculated using the extended Koopmans' theorem (EKT) |74 1 175 1 ^76]- The EKT provides 
a single-particle picture of the ground state, with single-particle energies and spectroscopic 



factors. The ionization energies are shown in Figure 5.6 the agreement between calculated 
and experimental values is very good, pointing to the realistic nature of the variationally 
obtained 2DM. The good agreement with experiment reflects the fact that the error in the 
description of the interelectronic cusp largely cancels since the ionization energy is an energy 
difference. For Be and Ne it is clear that the basis set limit is nearly reached at the cc-pVTZ 
- cc-pVQZ level. Even for Si the experimental ionization energy is closely reproduced. 



5.1.2.4 Correlated Hartree-Fock-like single-particle energies 

Another single-particle picture is given by the correlated Hartree-Fock-like single-particle 
orbitals and energies. These are constructed by diagonalizing the single-particle Hamiltonian: 



ha-r = {T + U)^^ + ^ VaP'-ySPfiS 



(5.23) 



116 



CHAPTERS. APPLICATIONS 



0.12 
0.11 
0.1 
0.09 
0.08 
.9 0.07 

Q. 

B 0.06 
o 

0.05 
0.04 
0.03 
0.02 
0.01 



l-2s A 
2p V 



* QZ 



^ « ♦ * 



10 



15 



20 



25 



Figure 5.8: Natural occupation of the 2p-orbital and one minus the occupation of 2s-orbital for the 
Be series, in all three basis sets. 



where the IDM p appearing in Eq. (5.23) is constructed from the variationally determined 



2DM. As an example of this method, the single-particle energies for the isoelectronic series 



of Be in a cc-pVDZ basis are shown in Figure 5.7 Notice that when Z increases, the energy 
levels approach those of the hydrogen atom. Similar behavior is present for the other basis 
sets and for the Ne and Si isoelectronic series. 



5.1.2.5 Natural occupations 



The eigenvalues of the IDM (i.e. the natural orbital occupation numbers) provide additional 
insight into the quality of the electron correlation effects included in v2DM. The occupation 
numbers from v2DM are always very close to those from full-CI, differing by at most 0.005. Of 
particular interest are the occupations of the quasi-degenerate 2s and 2p orbitals in Be. These 
are shown in Fig. |5.8[ The sum of the 2s and 2p occupations is nearly 1 and increasingly so 
for large Z. This implies that only the 2s and 2p are partially occupied in the large-Z limit. 
The shapes of the curves reflect the aforementioned imperfections in the basis sets, with the 
2s below the 2p for cc-pVDZ and cc-pVTZ, and above the 2p for cc-pVQZ. 
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Figure 5.9: Energy versus electron number for the Nitrogen atom. One can see that the v2DM 
method resuhs in a smooth convex curve, while the exact result is piecewise linear. 

5.2 Dissociation of diatomic molecules 

A diatomic molecule, consisting of N electrons spread out over two atoms, A with charge Za 
on Ryi and B with charge Zb on R^, is described by the Hamiltonian: 



H 



N 

E 



-yu 



Za 



R. 



+ 



Zb 



R 



B 



N 



+E 

Kj 



(5.24) 



One speaks of dissociation, or bond stretching, when the distance between the two bonded 
atoms |Ryi— Rb| is increased from its equilibrium value, and the dissociation limit corresponds 
to the situation where there is no interaction left between the atoms. Dissociation of molecules 
is very important in chemistry, because a good understanding of it is essential to understand 
why chemical reactions (which involve the breaking of bonds and the formation of new bonds) 
take place. Many electronic structure methods have great difficulty describing the dissociation 
limit, however, because when a molecule is pulled out of equilibrium, multireference effects 
occur. 



5.2.1 v2DM dissociates molecules into fractionally charged atoms 

To test how well the IQG conditions describe molecular dissociation the 14-electron NO'*' 
molecule was studied. From the ionization energies one can predict that the dissociation 
products should be N and O'^. Rather surprisingly the v2DM result was to divide the charge 
between the N and the O, assigning 6.53 electrons to N and 7.47 electrons to O. We also 
found that the energy was far too low. The same failure occurs in DFT, where the problem 
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Figure 5.10: Energy versus electron number for the Oxygen atom (See caption Figure 5.9) 



results from the wrong behaviour of the energy as a function of the number of electrons 
[TZIITEIESIISOIEIIES]. It turns out that in v2DM the same error lies at the basis of the faulty 



dissociation limit. As can be seen in Fig. 5.9 for N and Fig. 5.10 for O, the E vs. N curve is 
convex, while the exact curve is piecewise linear between the integer occupations. This can 



be understood from the discussion in Section 2.3.1 , where it was shown that the correct 2DM 



description of a system with fractional electron number is through an ensemble of integer- A^ 



2DM's. The results in Figs. 5.9 and 5.10 however, were produced in a naive approach by 
considering A as a continuous parameter in the integer N v2DM procedure as discussed in 
Chapterjsj This naive approach is relevant, however, because it is exactly what happens when 
considering the individual atoms as subsystems of an integer A^ molecule, using approximate 
A^-representability conditions, as explained in Section 2.3.1.3 From Fig. |5.11 one can see 
how this convex curve results in the fractionally occupied dissociation products. In this figure, 
the sum of the energies of N and O are plotted, for fractional occupations which sum to 14 
electrons. One can see that the lowest energy is obtained from the fractional occupation 
of Oxygen of 7.47, which is exactly the result we get from the molecular calculation in the 
dissociation limit! It is worth noting that the same behaviour is present when adding the T 
conditions, and as such adding them does not fix the errors in the dissociation limit. 



5.2.2 Subsystem constraints cure the dissociation limit 



In this Section we show that the result of Section 12.3.11 can be used to fix the dissociation 
problem in v2DM. From Theorem [3] we know that if a 2DM describing a molecule is A- 
representable, then the 2DM's of the the atomic parts of the molecules are fractional A- 
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Figure 5.11: Energy versus electron number for the sum of the Nitrogen and Oxygen energies. N is 
the number of electrons on the Oxygen atom. The v2DM method clearly has a minimum at 
fractional N. 



representable, and must obey the inequalities: 



TV/,V + TrrV^>i?o^fi^^ 



(5.25) 



for every two-particle Hamiltonian H defined on the atomic subspace V. From Figs. 



5.9 



and 5.10 we can see that these constraints are grossly violated when we choose the atomic 
Hamiltonian for the subsystem Hamiltonian H , where the exact fractional- A'^ energy Eq is 



the piecewise linear curve. If the inequalities (5.25) for the atomic subsystems in the molecule 



are enforced, the E vs. A^ curve automatically becomes piecewise linear, and the dissociation 
of the molecule results in integer occupied fragments. 



5.2.2.1 Implementation 

In this Section we discuss how to implement the subsystem constraints in the v2DM ap- 
proach. The procedure for applying the atom-A subsystem constraint in a diatomic AB can 
be summarized as follows: 

1. Solve the atomic v2DM problem for a central charge Z^ at various electron numbers, 
using the single-particle orbitals centered on A. In practice, only electron numbers near 
atomic neutrality are important. This generates the atomic energy E^ as a function of 
fractional electron number N (i.e. a piecewise linear curve as in Figs. 5.9 and 5.10). 



2. For each internuclear distance Rab, calculate the transformation matrix between the 
(nonorthogonal) atomic basis of the A-centered orbitals {ia), and the orthonormal 
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molecular basis |a) that is used in the SDP program, 

X^, = (u|a) . (5.26) 

The X matrix is easily constructed with standard quantities in molecular modelling 
packages, 

^i,a = /_^Cajri^iA\iD 1 (5.27) 

in 

with C the expansion coefficients of the \a) molecular basis in terms of all the nonorthog- 
onal orbitals centered on the various atoms. 



\a) = 

3d 



Y.^a,^n\3D) ^ (5-28) 



and with Si^^-jj^ = {iaUd) the overlap matrix for the atom-centered basis functions. 
The orthogonal projector on the subspace spanned by the A-centered orbitals, when 
expressed in terms of the nonorthogonal basis set, reads [83] 

^^ = E(^A')uKvi)(jA|, (5.29) 



where Sa is the block of the overlap matrix corresponding to the A-centered orbitals, 
and S^ is the inverse of this block. 

3. Perform the molecular v2DM calculation with the extra linear inequality: 

Tr (tV) +Tr {V^T^) > i^^=Tr(i^p") (530) 



Here: 

(i^)a7 = T.{iA\i^\JA)W,^Wf^ , (5.31) 

(iX-r = T^iSAhW^^Wf^ , (5.32) 

{V^U;^S = Y.^iAJA\V^\kAlA) .WtjVfpW^^WiJ , (5.33) 

ijkl 

and tA is the kinetic energy plus attraction to nucleus A. The coefficients W are given 
by: 



The inequality (5.30) is nothing but the application of Eq. (5.25) in the subspace defined 
by the single-particle orbitals centered on A and using the Hamiltonian of atom A. 
Obviously, atom B generates a similar inequality. 
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Table 5.9: Difference (in niHartree) between full-CI energy and variationally optimized 2DM energy 
without (2DM) and with (2DM+) subspace constraints. 
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26.95 


1.26 
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27.90 


0.66 


2.75 


11.30 


11.30 


8.00 


28.88 


0.53 


3.00 


12.00 


12.00 


8.50 


29.63 


0.38 


3.25 


12.56 


12.56 


9.00 


30.39 


0.37 



5.2.2.2 Numerical verification 

In order to show the value of the subsystem constraints, we present the potential energy 
surface of BeB^, computed for a separation ranging from 1 to 9 A. The main interest here is 
a proof-of-principle of the fact that the new constraints indeed severely restrict the variational 
freedom in the v2DM. We therefore opted for the fairly small Dunning-Hay basis |84| . making a 
comparison to full-CI calculations still feasible. BeB"*" is a good example since this 8 electron 
system dissociates into Be and B"*". Application of the I, Q and Q conditions for an 8 
electron system is expected to yield energies that are significantly too low compared to full- 
CI. However, at the dissociation limit, the energy of the molecule should be equivalent to that 
of isolated Be and B"*" . As shown in [66] , for both Be and B"*" the IQQ energy is very nearly 
equal to the full-CI energy, so application of the subspace constraints should result in much 
higher IQQ energies. 



Table 5.9 contains the energy of the molecule at different internuclear distances, computed 
at the full-CI level of theory as well as the variationally optimized 2DM energy with and 



without subspace constraints (see also Fig. 5.12) 



Table 5.9 very clearly shows that, at larger separation, the difference between the full-CI 
and 2DM energies is substantial when using only the X, Q and G constraints, growing as large 
as 30 mHartree. The subspace constraints succeed in reducing this error by approximately 
two orders of magnitude. As expected, the remaining error is very small because IQQ yield 
energies for the atomic 4-electron isoelectronic series that are very near to full-CI energies. 
The present new constraints are clearly very succesful. As Figure [5. 12 shows, the constraints 
are active most for separations above 4.5 A. The nearer to complete dissociation, the more 
of the error is recovered by the subspace constraints. As shown by van Aggelen et al. [68], 
not only are the energies improved, also chemical observables and chemical concepts are 
substantially better for the 2DM obtained when including the subspace constraints. As an 
example, the Mulliken population jBl] on the Be atom at 9 A is +0.38 when not using the 
subspace constraints, whereas inclusion of the subspace constraints yields a charge of 0.00, 
consistent with reality. The added constraints result in a much better description of molecular 
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Figure 5.12: Singlet dissociation energy curve of BeB^ calculated in fuU-CI, and determined 
variationally without (2DM) and with (2DM-f ) subspace constraints. 



dissociation. Neither atom still suffers from fractional occupancy at the dissociation limit. 
Addition of each subsystem constraint does not slow down the v2DM, as it adds a fairly 
simple linear inequality constraint. 



5.2.2.3 Extension to polyatomic systems 

For polyatomic systems, the addition of correct subsystem constraints is much less straightfor- 
ward than in the diatomic case. The subsystems are no longer only atomic systems, but can 
also be polyatomic systems, for which the constraints become dependent upon the geometry 
of the subsystems, i.e. the interatomic distance. The number of possible subspaces, grows 
rapidly with the number of atoms present in the molecule. It is unlikely that all of these 
constraints are needed and only a few will probably be violated when not included. A de- 
tailed study [86] was made by van Aggelen et al. of the performance of subsystem constraints 



in the triatomic molecular anion F^. It was found that for geometries with either clearly 
dissociated or short bonds, the correct dissociation can be obtained using only constraints on 
the spatially separated units in the system. E.g. when the three F atoms are so far apart that 
they don't interact anymore, subsystem constraints for the atomic parts are the only ones 
useful, or for the case where a diatomic molecule is clearly separated from a single atom, the 
correct constraints are the geometry-dependent diatomic one, and an atomic one. However, 
in situations which are not so clear-cut, additional constraints may become active. 
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5.2.2.4 Generalization of the atomic subspace constraints 

The current formulation of the subsystem contraints on diatomic systems has the drawback 
that they only become active at large distances, and do not improve the energy closer to 
equilibrium. As a consequence there is a non-parallellity of the v2DM potential energy surface 



compared to the exact one, which is clearly visible in Fig. 5.12 Ideas to improve on this exist 
but haven't been tried yet. One approach would be to modify the atomic Hamiltonian by 
including effects from its environment. This would result in an iterative optimization of the 
2DM, where the 2DM from the previous molecular calculation is used to create the subspace 
Hamiltonian for the constraint in the next iteration. The environment would appear only 
in the single-particle part of the subspace Hamiltonian corresponding to atom A, by adding 
to the Hamiltonian of atom A the nuclear attraction to atom B as well as the Hartree-Fock 
potential generate by the current iterative approximation for F. 

5.3 The one-dimensional Hubbard model 



The one-dimensional Hubbard model, as introduced in Section 4.3 is described by the Hamil- 
tonian: 

H = -tYj («U"»+i;'^ + al+i;aOi;'x) +^Y1 4^^n4iaii ■ (5-35) 

ia i 

This Hamiltonian is the simplest model describing the non-trivial correlations in a solid state 
lattice as a competition between the delocalizing hopping term and the local on-site interac- 
tion. In this Section we present and discuss the results of v2DM calculations, taking advantage 



of all the symmetries as described in Section 4.3 on a 50-site lattice with the IQG conditions. 



and on a 20-site lattice with the XQQT1T2 conditions. The Hubbard model has been stud- 
ied before using the v2DM method, see e.g. [871 HOI EH], but up to now only the half-filled 
lattice was considered. In this study we consider different filling factors, and extract various 
properties like the ground-state energy, two-particle correlation functions and the momentum 
distribution in order to assess the quality of the variationally obtained 2DM. 

5.3.1 Results 

The v2DM results discussed in this Section were all obtained using the primal-dual predictor 



corrector semidefinite programming algorithm described in Section 3.4.3 Although the one- 
dimensional Hubbard model can be solved exactly using the Bethe ansatz ^88l ESJ [901 [HI] , 
it is hard to extract information about the solution for finite systems. For the calculations 
on a 20-site lattice, we compare the data with the quasi-exact results obtained through a 
variational Matrix Product State (MPS) algorithm [5l[6l[92], written by co-worker Sebastian 
Wouters ^93j. For the 50-site lattice, however, this is no longer computationally feasible. At 
half filling a simplification in the Bethe-ansatz equations occurs, which allows to calculate the 
ground-state energy of finite systems by solving a set of non-linear equations (Lieb-Wu) [ 
At other fillings no data is available for comparison. 

5.3.1.1 Ground-state energy 



In Fig. 5.13 the ground-state energy per particle of the one-dimensional Hubbard model is 



plotted as a function of the on-site repulsion U (the hopping parameter t will always be 
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Figure 5.13: Ground-state energy per particle as a function of on-site repulsion U of the Hubbard 
model for a 20-site (top) and 50-site (bottom) lattice at half-, jj^ and ^ filling. For the 20-site 
lattice a comparison is made between v2DM using the IQQ and TQQT conditions, and a quasi-exact 
result using MPS. For the 50-site lattice only XQQ conditions are feasible, and these have been 
compared to the exact (Bethe-ansatz) result for half filling. 
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Figure 5.14: Ground-state energy per particle as a function of the lattice size L, indicating how 
fast the finite size results converge to the thermodynamic limit for half filling. 



taken equal to unity). In the top figure the v2DM results for the 20-site lattice are shown 
for three different fillings, 12 particles (^), 16 particles (A) and half filling. These were 
calculated using both the IQG and the XQQT conditions, and are compared to the quasi- 
exact variational MPS results. In the bottom figure the v2DM results for the 50-site lattice 
are shown for the same fillings (i.e. 30 particles {jn), 40 particles (jq) and half filling) . For the 
50-site lattice it was only possible to perform the calculations using the TQQ conditions, and 
compare to the exact solution obtained by solving the Lieb-Wu equations for the half-filled 
lattice [9i] . 

One interesting thing to notice is that the TQQ energy per particle for the 20-site lattice 
and the 50-site lattice, at the same filling, are very similar. This is due to the periodic 
boundary conditions which make the results converge quite rapidly for increasing lattice size 
L, implying that one can already extract relevant results for the thermodynamic limit by 
studying relatively small lattices. This fast convergence can be clearly seen in Fig. 5.14[ 
where we plotted the energy per particle of a Hubbard model with [/ = 1 at half filling, as a 
function of the lattice size L. 



Another thing to remark in Fig. 5.13 is that for the 20-site lattice, the difference between 



XQQ and XQQT is rather small for the half-filled lattice, but larger for the other fillings, 
and that the difference gets larger when U increases. For the 50-site lattice we see that the 
TQQ result agrees nicely with the solution of the Lieb-Wu equations. For the other fillings 
no reference data are available. There are, however, two limits of the model that are exactly 
solvable. The first limit is the case of no interaction, i.e. U = 0, for which the solution has 



already been given in Section |4.3| (see Fig. 4.2). The Hamiltonian reduces to a single-particle 
operator, which means this limit is already described correctly by including the X and Q 
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Table 5.10: Energy per site of the v2DM calculations away from half filling at large values of U, 
compared to the MPS and the Bethe-ansatz results where available. 
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exact 
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oo 
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conditions alone. The other exactly solvable limit is when U — )• -|-oo. In this limit the physics 
of the model decouples into two independent parts, one describing the spin of the system, 
and the other the movement of the particles (this is called spin-charge separation |95j). This 
decoupling shows up in the Bethe-ansatz wave function: the charge degrees of freedom are 
described by a Slater determinant of spinless fermions, whereas the spin degrees of freedom 
become equivalent to a spin-g Heisenberg model. The single-particle energy spectrum changes 



slightly compared to Eq. (4.97) because the boundary conditions for spinless fermions are 
periodic/antiperiodic if A'' is even/odd [95l |96] : 



Cfc 



-2tcosk where 



k 



k 



2iTn 

~ L 
(2n+l)7r 

L 



if N%2 = 
if N%2 = 1 



(5.36) 



When the lattice is half-filled all the single-particle states are occupied, and the total energy 



sums up to zero, which is correctly described by the IQG results in Fig. 5.13 Away from 



half filling, however, the energy has a finite limit which can be calculated using Eq. (5.36). 



From the figure we can see that the IQQ conditions do not suffice to correctly describe the 
large-C/ limit. Only when the T conditions are added, the results converge to the right limit. 
Calculations at very large values of U have been performed that confirm this statement, and 
these results are shown in Table [5T0l 



5.3.1.2 Momentum distribution 



The momentum distribution function is easy to extract from the 2DM, as it is just the 
IDM expressed in momentum space (See Eq. (4.101|)). The results of the v2DM calculations 



for jK and tt, fillings using both IQQ and XQQT calculations are presented in Figs. 



5.15 



and 5.16 for three different values of the on-site repulsion U. In case of XQ^, the values 
of the momentum distribution for the same filling factor but different lattice sizes L = 20 
and L = 50 lie very nearly on the same curve, and are therefore plotted together. Before 
the results are discussed we note that the one-dimensional Hubbard model is not a Fermi 
liquid, but a Luttinger liquid [971 EH |99]. The momentum distribution therefore has no 
discontinuity at the Fermi level kp in the thermodynamic limit. For finite systems, however, 
one has only discrete momentum values so one has to perform finite size scaling to see whether 
Luttinger or Fermi liquid behaviour appears. Since we only performed calculations at two 
systems sizes, finite size scaling is not possible. Nevertheless, conclusions about the quality 
of the variationally obtained 2DM can be extracted by comparing our results to numerical 
calculations in the weak-correlation limit using Quantum Monte Carlo (QMC) |100j . and 
in the strong-correlation limit by solving the simplified Bethe-ansatz |95j. The agreement 
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Figure 5.15: Momentum distribution in the ground state for a lattice filling of ^ and on-site 
repulsion U = 1,8 and 100 calculated with the IQG conditions (top) and the TQQT conditions 
(bottom). 
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Figure 5.16: Momentum distribution in the ground state for a lattice filling of j^ and on-site 
repulsion U = 1,8 and 100 calculated with the IQG conditions (top) and the TQQT conditions 
(bottom). 
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between the XQQ and XQQT results is quite good for the U = 1 curve, both for the ^ and 
jg fiUed lattice. These momentum distributions have the form one would expect for small 
values of U, i.e. a Fermi-sea type distribution with highly occupied low momentum states and 
with a steep drop around the Fermi momentum kp. As U increases the distribution becomes 
more spread out, and we note that the agreement between IQG and ZQQT deteriorates. The 
IQGT results tend to move away more from the Fermi-sea behaviour than the IQG results, 
which is in agreement with the earlier observations for the energy. 

For large values of U one would expect a discontinuity around 2kF, because of the spinless 
fermion description of the charge part of system. This jump is absent, in agreement with exact 
results [95j, which shows that the behaviour in the strong-correlation limit is more subtle. 
For jq filling we notice that the momentum distribution has a non-monotonous behaviour: 
the occupation first drops and then rises for higher momenta in the XQQT description of 
the large-C/ systems. This behaviour is absent in the TQQ description. This rise is physical 
and theoretically understood from the Bethe-ansatz solution at infinite U |95| : its absence 
in the 2DM determined with XQQ is another indication of the fact that the two-index XQQ 
conditions are not sufficient to capture the physics in the strong-correlation limit. 

5.3.1.3 Correlation functions 

Two-particle correlation functions are important quantities in the analysis of lattice systems, 
because they usually display the physics present in the system (for instance the appearance 
of magnetism). In this Section we show that in our approach, these correlation functions are 
easily extracted from the 2DM, and compare our results to those in [100^ 195). 



Charge correlation The two-particle charge correlation function is defined as: 

C{r) = {hjflj+r) = J2^4'^"'^''"']+r;a'^J+r;<r') ' 



(5.37) 



in which the notation (.) denotes the expectation value. The function is independent of the 
specific choice of the index j because of the periodic boundary conditions. The expression in 



Eq. (5.37) can be written in terms of the ^(F) matrix: 

C{r) = 2_^G{T)jaja;{j+r)cT'{j+r)cr' ) 



(5.38) 



and in fact only the singlet part of the G matrix appears: 

C{r) = 2 G{^)jj-{j+r){j+r) 



(5.39) 



In translationally invariant systems one usually takes the Fourier transform of the correlation 
function, 

C{k) = Y: e^''C{r) = 2 ^ j; Q {'^)tk,,k.u, ■ (5-40) 

C{k) has been plotted for 



In Figs. 



5.17 



5.18 



and 



5.19 



10' 10 



and half filling respectively, 
using both XQQ and XQQT conditions. Comparing the XQQ with the XQQT results the 
same trends can be noticed as for the energy and the momentum distributions. For half 



filling (Fig. 5.19) the XQQ and XQQT results are in nice agreement. Moving away from half- 



filling (Figs. 5.17 and 5.18) there is only agreement for small values of U . For larger values 
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Figure 5.17: Two-particle charge correlation function C{k), as a function of momentum, for a ^ 
filled lattice and various values of on-site repulsion U, using IQQ (top) and TQQT (bottom) 
conditions. 
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Figure 5.18: Two-particle charge correlation function C{k), as a function of momentum, for a ^ 
filled lattice and various values of on-site repulsion U, using IQQ (top) and TQQT (bottom) 
conditions. 
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Figure 5.19: Two-particle charge correlation function C{k), as a function of momentum, for a 
half-filled lattice and various values of on-site repulsion U, using IQQ (top) and TQQT (bottom) 
conditions. 



5.3. THE ONE-DIMENSIONAL HUBBARD MODEL 133 

of U strange oscillations appear in the IQG results. So in this limit not only the energy, 
but the entire physical content of the IQG-2DM cannot be trusted. This is once again an 
indication that the IQQ conditions fail to describe the strong-correlation limit away from 
half-filling. The IQQT results compare well, both in shape and magnitude, with the results 
from Quantum Monte Carlo |100j . and the Bethe-ansatz results in the strong-correlation limit 

ESI. 



Spin correlation The two-particle spin-correlation function defined as: 

S{r) = {SlSi'^''') = ^<y(T' {a]^aj„a\^^.^,aj+r;a') , (5.41) 

can be expressed as a function of the Q matrix: 

Sir) = 2Z^^'^(^)i'^J<^;(i+'')'^'(i+^)f^' • (5-42) 

Written in terms of the spin-coupled G matrix, only the triplet 5=1 part contributes: 

s(-) = ^^(r)].;o+.)o+o • (5-43) 

Fourier transforming S{r) yields the momentum dependent spin-correlation function: 

S{k) = Y, e^'^^Sir) = ^ E E ^5./.... • (5-44) 



We have plotted this object in Figs. 5.20 5.21 and 5.22 for A, j^ and half filling respectively. 



using both XQQ as XQQT conditions. Unsurprisingly, a good agreement is observed between 
the XQQ and XQQT results for small values of U . At larger values of C/, away from half 
filling, results are very poor with the XQQ conditions, especially in the jQ-filled case, where the 
correlation function is wildly oscillating. More surprising, is that the spin-correlation function 
for the half-filled lattice in the large-C/ limit is also incorrect in the XQQ approximation. In 
the strong-correlation limit, for half-filling, the spin part of the Hubbard model is identical 
to the Heisenberg model [95], for which the spin-correlation function has a singularity at two 
times the Fermi momentum 2kp = vr, which is exactly what we see in the XQQT results. 
Below half- filling the singularity in the large- [/ limit splits and shifts to smaller values of A:, as 



obserbed in the XQQT figures 5.20 and 5.21 This is in agreement with the results in f95j and 
[lOOj . In conclusion we can say that the 2DM obtained with the XQQT conditions, correctly 
describes the physics that governs the spin-correlation function, whereas the XQQ conditions 
do not. It is also important to note, that even though the XQQ results for the energy are 
good for the half-filled lattice, the 2DM is flawed, because the spin-correlation function is not 
correctly described. 

5.3.2 Failure in the strong-correlation limit 

The standard two-index conditions, XQQ, have a problem describing the strong-correlation 
limit, U — )• oo, of the Hubbard model. The three-index conditions, however, describe the 
physics of the model correctly. In this Section we analyse why this is the case, and derive 
some non-standard constraints that try to fix this behaviour at the two-index level. 
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Figure 5.20: Two-particle spin correlation function S{k), as a function of momentum, for a j^ filled 
lattice and various values of on-site repulsion [/, using IQQ (top) and TQQT (bottom) conditions. 
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Figure 5.21: Two-particle spin correlation function S{k), as a function of momentum, for a -^ filled 
lattice and various values of on-site repulsion [/, using IQQ (top) and TQQT (bottom) conditions. 
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Figure 5.22: Two-particle spin correlation function S'(fc), as a function of momentum, for a 
half-filled lattice and various values of on-site repulsion U, using IQQ (top) and TQQT (bottom) 
conditions. 
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The strong-correlation limit of the Hubbard model has a very clear physical structure 
[95| l96] . The spin part of the system, described by the Heisenberg model, is decoupled from 
the charge part of the system, which is described by spinless particles hopping around in a 
lattice. The statistics of these particles is, however, much more complex than the original 
spin-^ fermions, because the blocking of double occupation only occurs in site space. There 
is no reason why the momentum states couldn't have an occupation larger than one. The 
statistics of these particles, called hard-core fermions, has been studied in some detail and is 
called orthostatistics pJUl ITUSl 111)3] . 

When analysing the structure of the IQQ-2DM, it is found that there is no double occu- 
pation of lattice sites, i.e. on has correctly 

4;,fc ^ . (5.45) 

Therefore the error in the energy results from a huge overestimation of the hopping energy 
of the spinless particles, due to the faulty IQQ description of the hopping of particles in a 
singly-occupied lattice. Based on this analysis we next derive two constraints that amend the 
pathological behaviour oilQG in the strong-correlation limit. 

5.3.2.1 The non-linear hopping constraint 

The first constraint imposes a bound on the hopping energy through knowledge of the exact 
result Tj^jj in the C/ — ;■ oo limit and the doubly-occupied fraction of particles on the lattice. 
We can divide the full A^-particle Hilbert space into orthogonal subspaces characterized 
by a fixed number n of doubly occupied sites, i.e. the eigenspaces of: 



n 



Yj 4t°4"4«it • (5-46) 



An arbitrary wave function can now be decomposed as: 

|^) = ^|$H), (5.47) 

n 

in which the j^'")) are not necessarily normalized. The expectation value of the number of 
doubly occupied sites is given by: 

(n) = ^r,t4;.t4 = J]n(^(")|M/(«)) (5.48) 

i n 

>^(^W|$(«)) (5.49) 

=1_(^{0)|^(0)^ _ (5.50) 

From this we can derive a lower bound for the occupation of singly-occupied space, i.e. 

(^(0)|^(0)^ > l_f^ ^ (551) 

in which we have introduced the pair trace symbol: 

Tp = ^ Ti^H-i^H . (5.52) 
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Now take an arbitrary wave function and decompose it into its singly-occupied component 
and the remainder: 

|^) = a|^n+/3|^^) , (5.53) 



in which 



a\^' 



|^(o)\ . 



(5.54) 



The expectation value of the hopping term T in an arbitrary wave function can then be 
expressed as: 



Both the lowest eigenvalue T^jn of the hopping operator on the full space (see Section 



(5.55) 
and 



4.3 



in Fig. 4.2), as well as the lowest eigenvalue T^^^ on singly-occupied space (see Eq. (5.36)) 



are known. Being the lowest eigenvalues the following inequalities hold: 
r°i„ < (^^|f|^^) , and r^„ < (^^°^|f|^^°'^) 



(5.56) 



Eq. (5.55) can now be rewritten as an inequality: 



Tr rr >iaPr°° + 



1 2^0 
I -'min 

\2rri0 I ol^ll/PlT^O 



2|a 



rpU 



+ 2a BT^ 



/(/3). 



(5.57) 



The second line in Eq. (5.57) follows from the fact that the lowest eigenvalue T^jn is always 
negative. Eq. (5.49) implies that |/3p < Tp, so the inequality (5.57) still holds upon replacing 



by Tp, as long as / is a decreasing function. It follows that (5.49) can be formulated as 



a non-linear inequality constraint which can be expressed solely in terms of the 2DM: 



Tr rr > (1 - rp)r-„ + r^^, rp + 2, rp i - Tp 



/ Tp 



(5.58) 



This constraint is necessary as long as / [Tpj > T^^^^, i.e. when 



rp< 



l + c 



with 



min min 

2ro. 

min 



In Fig. 



5.23 



(5.59) 



an example is given for the inequality curve / I Fp I for a 6-site lattice with 5 

particles, in which case T^i^ = —7 and T^^^ = —2. Note that when Fp — )■ the correct 
strong-interaction limit is restored. For larger values the curve quickly descends to the point 
where it crosses T^^^, after which the constraint becomes redundant. 

We have implemented this constraint in the dual-only potential reduction program dis- 



cussed in Section 3.4.2, as it is the most straightforward algorithm to which a non-linear 



constraint can be added. This potential in Eq. (3.46) (here expressed as a function of the 
2DM F) simply acquires an extra term: 



^(F) = Tr Fif (2) _ t In det Z(T)-t In [Tr FT - /* (F)] 



(5.60) 
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Figure 5.23: The inequality curve / [Tp] for a 6-site lattice with 5 particles 



where 



r(r) = <^ 



(i-rp)T-, + TOi Jrp + 2Jrp(i-rpj for Tp<j 



l+c 



2Tr r T^O 



(5.61) 



for Tp > J 



+c 



TVCAf-l) min 

The algorithm's workhorse is Newton's method, for which we need the gradient and Hessian 



of the potential in Eq. (5.60). The gradient can be written in matrix form as in Eq. (3.51), 



with the addition of an extra term coming from the non- linear constraint: 



Vcp = pTriH'~^'>-tY,J^][Cj{rr' 



[Tr FT - f*{T)] 

where the matrix Ip defined in such a way that: 

Tr rip = Tp . 
The non-linear gradient function g reads: 



[T-g{T)lp] 



5(r) = 



Tl„-r^„+ , ^^'" . (i-2rp) for rp<^ 



nun min 



rp(^i-rpj 




(5.62) 



(5.63) 



(5.64) 



for Tp > 



l+c 



As in Eq. (3.55), the Hessian can be seen as a map from traceless two-particle matrix space 



on itself. Compared to Eq. (3.55) there are two extra terms coming from the non- linear 
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L = 6 


iV = 5 






L = 10 


iV = 9 




u 


IQG 


IQQT 


non-lin 


exact 


TQQ 


IQGT 


non-lin 


exact 


50 


-3.55 


-2.29 


-3.06 


-2.20 


-4.89 


-2.54 


-4.64 


-2.46 


100 


-3.49 


-2.15 


-2.51 


-2.08 


-4.77 


-2.27 


-3.66 


-2.22 


1000 


-3.44 


-2.03 


-2.05 


-2.01 


-4.67 


-2.03 


-2.15 


-2.02 



Table 5.11: The ground-state energy of a 6-site lattice with 5 particles (left), and a 10-sitc lattice 
with 9 particles (right), for U — 50, 100 and 1000, exact results compared with v2DM results using 
IQQ and IQGT results, and IQG results with the non-linear constraint. 



constraint. The action of the Hessian map on a traceless two-particle matrix A is given by: 

'TrrA-5(r)Sp' 



HA =iPTr 



j;4(£fc(r)-iA(A)A(r)-i) + 



[Tr rr-/*(r)]^ 



[T-gir)lp] 



+ 



h{r)Ap 



Tr TV - f*(T) 
with the non-linear Hessian function h defined as: 



tp 



(5.65) 



h{T) 



mill 



2 Tp i-r 



0) 



jyl-2rp] + 



mm 



Fp i-r, 



for rp<^ 



(5.66) 



for rp>^ 

After the Newton step A has been determined, we perform a search in the direction of A to 



find the optimal stepsize a. In Section 3.4.2 we showed that this can be done rapidly using 



a bisection algorithm to determine the roots of the scalar function in Eq. (3.65). An extra 



term is added to this function through the inclusion of the non-linear constraint, leading to 
the new line-search function: 



V„0(a) = TrA//(2)_i^K^ 



xf' 



df 



1 + aXf' 



where the derivative of /* with respect to a is: 



5r 

da 



(r + aA) = (T^in - T^J Ap + 



Tr TA - ^ (r + oA) 
Tt {r + aA)T-f*{T + aA) 



-'min^P 



(5.67) 



Tp + aAp^ (l-Tp- aAp^ 



1 - 2 ( Tp - Ap 

(5.68) 



The results of such a calculation, on a 6-site lattice with 5 particles and on a 10-site 
lattice with 9 particles for large values of U are shown in Table [STTT} the non-linear hopping 
constraint results are compared to IQQ, XQQT and exact results. For the 6-site lattice 
the exact results were calculated using diagonalization, for the 10-site lattice this no longer 
possible and an MPS algorithm was used [HJ El E21 [S3]- It can be seen that the non-linear 
constraint is succesful as the strong-correlation limit is correctly recovered. It is, however, 
also observed that the constraint only becomes active at very large values of C/ > 30, and 
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as such does not improve the quality of the IQG energy curves for the U range displayed in 



Fig. 5.13 Calculations performed at larger lattice sizes indicate that the value of U where the 
constraint becomes active increases with lattice size, which renders it even less valuable as a 
practical tool. In conclusion, we can say that the non-linear hopping constraint guarantees 
that the energy converges to the right value in the large- [/ limit, which is what we set out to 
achieve. But it fails to improve the quality of the energy and 2DM at intermediate values of 
U. 



5.3.2.2 The Gutzwiller projection constraint 

In the previous Section we constructed a new constraint by looking at the symptom i.e. the 
energy in the strong-correlation limit is far below what it should be. We imposed an exact 
lower bound to the energy for a certain double occupation Tp. In this Section we derive a 
constraint that fixes the origin of the problem, being the faulty description of spinless particles 
hopping on a lattice. The creation and annihilation of particles on a singly-occupied lattice 
can be described by the so-called Gutzwiller projection operators [104^1105] : 



go 



9c 



1 



O-a^o: 



1 



(^a^ct 



(5.69) 
(5.70) 



where a and a are single-particle indices on the same site with opposite spin. One can see in 



Eq. (5.70) that a particle in the state a can only by created if there is no particle present in 



the opposite spin state a; if there is, the Gutzwiller operators annihilate the state. In analogy 
with the necessary and sufficient conditions on the IDM derived in Section 2.2.1, we now 



propose that every Hamiltonian that is expressed as a first-order operator in the Gutzwiller 
creation and annihilation operators will be correctly optimized if the following conditions are 
fulfilled: 

(5.71) 

(5.72) 



P^ hO 


with 


Pap-- 


- (^^bi^/^i^^ 


q^ hO 


with 


q%-- 


= (^^ISa^l^^ 



It is clear from Eqs. (5.69) and (5.70), that these constraints are third-order operators, in 



the underlying fermion creation/annihilation operators. This explains why three-index con- 
straints are needed to describe the strong-correlation limit, and why the two- index conditions 



fail. By imposing the Gutzwiller conditions (5.71) and (5.72) we introduce the relevant corre^ 



lations present when \J increases, without having to resort to the full three-index conditions. 
The problem is that the Gutzwiller conditions cannot be expressed as a function of the 2DM, 
so if we want to stick to a 2DM framework, we have to manipulate the conditions somehow. 
Using the same approach as for the T conditions, we can anticommute the two Gutzwiller 
conditions to reduce the rank of the operators by one: 



G^O 



with 



G 



al3 



{91,913} 



:i 



OiQi^Ct) 



(5.73) 



This is obviously a trivial constraint, in fact it is already included in the XQQ conditions. If 
we expand the Gutzwiller conditions we can express them as a function of first-, second- and 
third-order operators: 

j J- aa;/3a ^ \^ 



PafS 
QafS 



Pal3 



lap 



r. 



|a^a^aaa^a^a/3|^ 



G(X)l 



QiX)^ 



+ (^ 



N\J 



J J . 



aka.nCinOj'aa'j,ao\^ 



N\ 



(5.74) 
(5.75) 
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One can reduce the order of these constraints by only anticommutating the third-order part, 
which leads to the following two matrix functions: 

P (r)a/3 = Pal3 -'^aP;l3$-'^aa;l3a + T2{T)^^^.p^^ , (5.76) 

Q^(X)a(S = Qap-G{T)^^.^p-g{r)s,a;al3 + T2{T)^s^^.00 . (5.77) 

These have to be positive-semidefinite, since a positive matrix was added to the original 
Gutzwiller conditions. Expressing Ti as a function of the 2DM one can rewrite: 

P (r)a^ = Pal3 - ^a^;l3^ " ^aa;l3a + Sa/^Paa , (5.78) 



(2Tr r \ 



(5.79) 



These matrix positivity conditions are easily included in any of the algorithms described in 
Chapter [3j and at a very small computational cost as the dimension of the matrices is the same 
as that of the IDM. Taking translational invariance into account, these constraints even reduce 
to L linear constraints, which can straightforwardly be included in the SDP algorithms from 
Chapter [3| The new constraints were implemented and added to the translationally invariant, 
parity-symmetric primal-dual program. Calculations were again performed for lattice sizes 
of 20 and 50, with Jj and j^ filling, the results of which are shown in Fig. 5.24 It is seen 
that the constraints clearly include something that is missing in the IQQ conditions, because 
they become active at reasonably small values of C/ > 5. The amount of improvement when 
including the Gutzwiller conditions, however, is quite disappointing. Moreover, calculations 
at higher values of U indicate that the [/ — >• cxd limit is not restored by adding these conditions. 
The reason for this is probably the addition of the anticommutator term to the third-order 



terms in Eqs. (5.74) and (5.75). In conclusion we can say that we have identified the origin 
of the failure of the IQQ conditions, and derived constraints on a subspace of three-particle 
space that have to be included to fix the limit. If we manipulate these constraints so that 
they can be expressed as a function of the 2DM, the resulting conditions improve the results, 
but are not strong enough to fix the strong-correlation limit. The only way out is to introduce 
an object beyond the 2DM, which is the subject of the next Chapter. 
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Figure 5.24: Ground-state energy per particle as a function of on-site repulsion U of the Hubbard 
model, for a 20-site (top) and 50-site (bottom) lattice at jq and ^ filling. A comparison is made 
between an optimization using only TQQ and one using TQQ and the Gutzwiller conditions. 



CHAPTER 6 



Restoring the strong-correlation limit in the Hubbard model 



Previously ( Section |5 .31 ) we discussed the v2DM study of the one-dimensional Hubbard model. 
We found that the two-index conditions fail to describe the strong-correlation limit, (i.e. 
strong on-site repulsion or [/ — )• c«), and that three-index conditions 7i, T2 are needed to 
incorporate the physics in this limit. We identified the origin of the two-index failure and tried 
to fix it, cheaply, by applying constraints on a subspace of three-particle space, the so-called 
Gutzwiller conditions. This was unsuccesful because we approximated these constraints in 
order to express them as a function of the 2DM. In this Chapter we go beyond the 2DM 
and identify the minimal object from which both the genuine Gutzwiller conditions (Eqs. 



(5.74) and (5.75)), and the two-index conditions can be simultaneously derived. We establish 
that the strong-correlation limit is restored, without resorting to the computationally heavy 
three- index constraints. 



6.1 An intermediary object: the 2.5DM 

The most compact object from which both the Gutzwiller conditions and the two-index 
conditions can be derived is the 3DM with one index diagonal, which we call the 2.5DM: 



^^PmS = Y.w^{^f\aiala{a^asa^\^f) . (6.1) 



Just as the 2DM, the 2.5DM can be used as the basic variable in a variational scheme. 
By permutation of the indices one can derive 6 conditions on this object, called the lifting 



conditions [20l|22l|26]. Instead of using Eq. (6.1) we preferred to use the slightly larger object: 



i 

where the third index is diagonal in the site index only, and not in the spin index. This 
is a more desirable object since it captures more correlations. In addition we found that 
the extra flexibility makes a significant difference in the final energy prediction. It is also 
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s 


Sab 


sp 


ordering 


dim 


deg 


1 

2 
1 
2 
3 
2 




1 
1 




a<b 
a <b 
a < b 


L(L+1) 

2 
L(L-l) 

2 
L(L-l) 

2 


2 
2 

4 



Table 6.1: The dimensions, degeneracies and orbital ordering of the S — I and 5 = | blocks 



appearing in the 2.5DM object defined in Eq. (6.3) 



advantageous compared to the 2.5DM in Eq. (6.1) because a spin-coupled version can be 
constructed, defined as: 



W 



llS{Sab',Scd) 

\ab\cd 



E-^T^E(^^-M.i^^ 



[S] 



B 



s 



ab{Sab)l ^cd{S^i)l\^SM,il ' 



^ 



N 



(6.3) 



M 



with B^ the three-particle creation operator as defined in Eq. (4.43) 

Sab 



^ ab{Sab)c 



1 



VW) 



al®al 



,t 



(6.4) 



In the same manner as for the spin discussion in Chapter |4j a spin-averaged ensemble is used 
to decompose the 2.5DM into blocks with different three-particle spin S. Because of this, the 



spin-coupled 2.5DM can be optimized faster than the spin-uncoupled 2.5DM in Eq. (6.1). The 



spin-coupled 2.5DM in Eq. (6.3) can only be written as a function of the 2.5DM in Eq. (6.2), 



since off-diagonal spin terms are needed to produce the correct coupling, as can be seen from 



expanding Eq. (6.3): 



W 



l\S{Sab\Scd) 

\ab\cd 



ypIM.t 



E E {l^^a^'^b\SabMab){SabMab-(Tl\SM) 



(TbaiMab 



E E {\^c\od\S,dM,d){ScdMj-a[\SM) W^ 



O-cO-d M^ciC7[ 






(6.5) 



In practice we store L blockmatrices, one for each site. Every blockmatrix consist of two 
blocks, one with spin- 2 and one with spin-|, with dimensions and degeneracies given in 



Table 6.1 There is another complication that arises when using the 2.5DM, which is that 
because of the antisymmetry between some single-particle indices in the 2.5DM, relations exist 
between elements in different blocks. These relations have to be imposed as extra consistency 
conditions. 

6.1.1 The consistency conditions 

To get insight into the problem, we introduce the consistency constraints for the spin-diagonal 



2.5DM in Eq. (6.1). There, 2L blocks are stored, all of dimension (2L — 1)(L — 1), with a < /3 

(6.6) 



and both a and /3 different from A. When /? = (5 in Eq. (6.1) it follows that: 



w: 



Wt. 



When j3 = 5 and a = 7 there are two equalities to take into account: 



W, 



A 

al3;al3 



W: 



p 



ciX\aX 



W' 



XP]XI3 ■ 



(6.7) 
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These equalities have to be included in the SDP program using the method of hnear equality 
constraints explained in Section [3.4.2.4 We have to find an operator that projects on the 
right subspace (see Eq. (3.86)). In the present case the projection operator is very simple: 
for the case of one equal index pair, 



^a/3;7/3 



1 r 






and for the case of two equal index pairs, 



Wt 



w: 



+ w. 



/3 



a\:a\ 



+ w%. 



A/3;A^ 



(6. 



(6.9) 



For the spin-coupled 2.5DM in Eq. (6.3) the consistency conditions become a lot more 



complicated, for two reasons. First, the first and/or second lower index can be equal to the 
upper index defining the block, which leads to relations between elements in the same block. 
Second, exchanging upper and lower indices is not straightforward, but involves recoupling 



of the intermediate spin, as was the case for the 7i conditions (See Eq. (4.46)). This merely 
leads to a lot of bookkeeping problems. In what follows we list one example for every class of 
possible symmetries. 



6.1.1.1 Case of one equality 

When there is one equality between the indices, two different types of projection appear: 



a = 1 : when one lower index is equal to the block index /; the W matrix must satisfy the 
identities: 



i|S(S„,;5,,) _ j^^^^ Y}.Sn\ <! f ? 5' !> W^'l^(^"'-'') 



^ \lh;cd ""■ = \.^ah\2_^\.aih\\ 1 1 ^ 



ah 



'lb;cd 



and the corresponding projection operator reads: 



W 



lb\cd 



w 



'fl^"''-'^ + [Sab]Y.\Slb\\^l ? f'^^M^'l^^^--^) 



Sib 



lie 
2 2 '^'^^ 



Hbicd 



(6.10) 



(6.11) 



a = c : when two lower indices are equal, consistency requires that: 
W 



i,5(5„,;5.,) _ ^s^^^^Scd] Y. iSlb][Sl,] <! f ? 5'Uf ? 5' !> t^'^l^^^'"'") 



'ab;ad 

SibSid 

which leads to the projection: 
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cd 



Hbdd 
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+ [Sab] [Sc(l\ 2_^ [Sib] [Sld\ 
SibSid 

^ o ^Ib II'-' 9 ^Id 



1 1 

2 2 



S 



ab 



1 ? r.^^i/wd 

2 2*^' 



'cd 



a\S{Sib;id) 



(6.12) 



(6.13) 



6.1.1.2 Case of two equalities 

Three diff'erent cases appear, where two equalities exist between the indices: 
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a = c and c = 1 : Now there are three symmetry relationships the elements have to satisfy: 
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The corresponding projection reads: 
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+ (6.14)^ + (6.15)^ + (6.16) 



where the notation (6.14)^ is short for the right-hand side of Eq. (6.14) 



(6.14) 
(6.15) 
(6.16) 



(6.17) 



a = c and b = 1 : two different relationships have to be satisfied: 
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which are imposed using the following projection: 
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a = c and b = d : for the last type two equalities have to be satisfied: 
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which can be imposed using the projection: 
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6.1.1.3 Case of three equalities 

There are two different instances where three equalities hold between the indices. 
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a = b, b = c and d = 1 : For the first type, consistency requires that the following three 
conditions that are satisfied: 
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leading to the following projection: 
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a = c, b = d and a = 1 : For the final type, there are four conditions that have to be 
fulfilled: 



W' 



l\S{Sab\Scd) 

\lbdb 



[^ab] 2_JiSlb\ \ 1 
Sib 



S K Sih 



2 2 "-"afe 



w 



l\S{Si},;Scd) 
\lb;lb 



W' 



l\S(Sal,',Scd) 

\lb:lb 



[Scd] y jsu 



Sid 



S 2 "^Id I TTrl\S{Sab;Sld) 



ld\\ 1 1 



lie 
2 2 "-^cd 



W 



Ibdb 



w 



'^'^'^''''''^ =\Sab\[Scd]Y.^^l^^^SlAl I -'"^^^ ^ Su\^l,SiS,b-Ad) 



\lbdb 



SibSid 



2 2 



S, 



ab 



Sr. 



w 



l\S{Sab',Scd) 

\lb;lb 



=2[5,,][5erf](-l)^-+^- Y, [Sai][Sci]{-lf^' 



11-, 

2 2 '-'cd 

'Scl 



\lbdb 



SalS, 



cl 



Q 1 Q^fcl C 
'-' 9 *-'ai I I 'J 9 '-'( 



2 2 



'-'ab J I 9 9 ^cd . 



\lldl 



This is imposed by the following projection: 
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(6.32) 



Symmetry relations between elements in the same site block / causes some of the elements 
to be linear dependent, implying that a W matrix which satisfies the consistency conditions 
has zero eigenvalues. This can be circumvented by taking the pseudo- inverse of the W matrix 



(see also Section 5.1.1.1), which excludes the eigenvectors with zero eigenvalues from the 
inversion process. 



6.2 The spin-adapted lifting conditions 

There are six independent matrix positivity conditions that can be expressed as a function of 
the 2.5DM. They are derived in the standard way, by finding manifestly positive Hamiltonians: 



H = Y^I^^ 



(6.33) 
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that can be written as a function of the 2.5DM using anticoniniutation relations and spin 
recouphng. 

The Xi condition The Xi condition trivially expresses the positivity of each individual site 
block of the 2.5DM, analogous to the I condition for the 2DM: 
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(6.34) 



The positivity of this object is clear from its definition in Eq. (6.3). Bear in mind that every 



site block is composed out of two spin blocks, both of which have to be positive semidefinite. 



The Q2 condition: the first non-trivial positivity constraint is the Q2 condition, which is 
defined as: 
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(6.35) 



with B^ given by Eq. (6.4). This is the equivalent of the Q condition for the 2DM, and it 



is readily seen that every site block has to be positive semidefinite. Using anticommutation 
relations and some straightforward angular momentum recoupling algebra we can express Q2 
as a function of the 2.5DM: 
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This is quite similar to the 71 condition with one site-index pair diagonal. The difference lies 
in the fact that the third-order term does not disappear (as in 7i, due to the anticommutator) . 
But the third-order term is in fact the 2.5DM matrix, so we have that: 



n- /r\^(^abjScd) -T /Jjr\l\S{Sab]Scd) , /-) /Jjr\l\S(Sab'<Scd) 
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(6.37) 



6.2. THE SPIN-ADAPTED LIFTING CONDITIONS 



151 



The unit matrix on 2.5DM space used in the Q2 niap (first term on the right in Eq. (6.36)) 



is not a diagonal matrix, as a result of the consistency conditions inside a site block, which 
introduces terms with off-diagonal intermediate spin: 
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In the Q2 map of Eq. (6.36), the 2DM appears; it can be derived from the 2.5DM by tracing 
over the block index: 
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The X2 condition: the X2 condition has no equivalent in the v2DM formalism. It is defined 
in a spin-averaged ensemble as: 
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with B^ a two-particle-one- hole operator as defined in Eq. (4.50): 
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Again, every site block of the matrix has to be positive semidefinite. Using anticommutation 
relations and spin recoupling we can express the X2 condition as a function of the 2.5DM: 
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The Qi condition: the Qi condition is of the same type as the X2 condition, and reads: 

(6.42) 
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condition it can form the 72 condition diagonal in the third site-index: 



with B\ the two-particle-one-hole operator as in Eq. (6.41). In combination with the X2 

(6.43) 
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Retracing familiar steps, one obtains an expression of the Qi condition as a function of the 
2.5DM: 
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The Qi condition: the remaining two conditions are of a type we haven't encountered 
before. The Qi condition is defined through a spin-averaged ensemble as: 

(^i)'l2S^'"^ = E^^tJp T.i^SM,\B^abiiSu) B'^iisJ^SM,) ' (6-45) 

i '■ ' M 

where B^ is a one-hole-two-particle operator, i.e.: 

,g r I fl '-' 

= E E (-l)^"''"(^ - ^a55,M,,|5M)(^cT,^a,|5MM,,)a,.„a^aJ^^ . (6.46) 

Notice the difference with the regular three-particle spin coupling. First the second and third 
index are coupled to intermediate spin 5';,^, after which the first index is coupled with Su 
to total spin S. Using anticommutation relations and spin-recoupling algebra we obtain the 
following expression of the Qi map as a function of the 2.5DM: 
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The t/2 condition: the Q2 condition is of the same type as Qi, and defined through a 
spin-averaged ensemble as: 
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where this time the B' is a one-particle-two-hole operator, given by: 
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Anticommuting the creation and annihilation operators, and spin-recoupling the emergent 
terms, leads to the following for the Q2 condition as a function of the 2.5DM: 
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6.3 Formulation as a semidefinite program 

We can reformulate the whole optimization problem using the 2.5DM as the central variable. 
For some Hamiltonian H, we optimize the matrix W under the constraint that is has the 
correct particle number, fulfills the consistency conditions, and has positive semidefinite linear 
matrix maps, as discussed in the previous Section: 
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The 2.5-Hamiltonian introduced above is defined by the relation: 

Tr WH^^-^^ = Tr TH^^'> . 



(6.52) 
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This optimization problem can again be formulated as a semidefinite program, by expanding 
W in a complete orthogonal basis {/^} of traceless 2.5DM-space: 
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in which the /p^/'s satisfy the consistency conditions. We can now define the v2.5DM problem 
as a dual-form semidefinite program with the following structure matrices: 
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The whole formalism developed in Chapter |3] can now be taken over, provided we find correct 



Hermitian adjoint maps. These are derived in Section C.2 



6.3.1 The overlap matrix 

For the primal-dual interior point method and the boundary point method in Chapter [3] we 
introduced the overlap matrix (See Section 3.4.3.2). For the translation of this algorithm to 
the v2.5DM formalism, we need to introduce a new overlap matrix: 
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Analogous to the 2DM case, the overlap matrix can be interpreted as a map from 2.5DM 
space on itself: 



SA = P^ 
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in which we sum over all the conditions introduced in the previous Section, and with Pw a 



projection on fully consistent traceless 2.5DM space, as explained in Section 6.1.1 Unfortu- 
nately this overlap matrix can not be easily inverted, as was the case for the 2DM overlap 
matrix, because the lifting conditions are not invariant under unitary transformations on 
single-particle space. Instead of inverting it analytically, we use the linear conjugate gradient 
method to compute the action of the inverse overlap matrix on a 2.5DM. This slows down 
the program, but not dramatically, as the number of iterations needed for the conjugate gra- 
dient loop to converge is small, and remains constant during the program. What is more, the 
computational cost of one overlap matrix- vector product scales as M^, whereas the heaviest 
computations in the algorithm scale as M'^ . This should be compared to the M^ scaling of 
the full three-index conditions. 



6.4 Results 

The formalism introduced above has been implemented for the boundary point algorithm 



(See Section 3.5). The results of such a v2.5DM calculation are shown in Fig. 6.1 where 



the ground-state energy of the one-dimensional Hubbard model is plotted as a function of 
the on-site repulsion [/, for a 6-site lattice with 5 particles, and for a 10-site lattice with 
9 particles. The v2.5DM results are compared to the v2DM results using XQQ and XQQT 
conditions, and to exact results. For the 6-site lattice exact diagonalization was used, for 



6.4. RESULTS 



155 




10 



12 



14 



16 



U/t 



Figure 6.1: Ground-state energy as a function of on-site repulsion U of the Hubbard model, for a 
6-site with 5 particles (top) and for a 10-site lattice with 9 particles (bottom). A comparison is made 
between an optimization using IQQ conditions, TQQT conditions, v2.5DM calculations and exact 
results. 
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L = 6 


iV = 5 






L = 10 


iV = 9 




u 


IQG 


IQGT 


V2.5DM 


exact 


IQG 


IQGT 


V2.5DM 


exact 


50 


-3.55 


-2.29 


-2.28 


-2.20 


-4.89 


-2.54 


-2.53 


-2.46 


100 


-3.49 


-2.15 


-2.14 


-2.08 


-4.77 


-2.27 


-2.26 


-2.22 


1000 


-3.44 


-2.03 


-2.01 


-2.01 


-3.44 


-2.03 


-2.01 


-2.01 



Table 6.2: The ground-state energy of a 10-site lattice with 9 particles (right) and 6-sitc lattice 
with five particles (left) for U = 50, 100 and 1000, exact results compared with v2DM results using 
IQG and IQGT results, and v2.5DM results. 



the 10-site lattice this is no longer computationally feasible, but we can again compare to 
quasi-exact results calculated with an MPS optimization [5l El |92l |93]. In this Figure, one 
can see that the v2.5DM results are always of IQGT quality without resorting to the full 
TQGT framework. In fact, the v2.5DM results are slightly better than those obtained with 
TQGT- This is because the 71 and Ti conditions express the positivity of an anticommutator 



of three-particle operators (see Section 2.2.2.2), whereas in v2.5DM positivity is imposed on 
all possible individual products of three-particle operators, be it of a restricted class. In 



Table 6.2 one can see that for very large values of U, the exact limit is restored, which we 
expect from the discussion about the Gutzwiller conditions in the previous Chapter. 

Using these spin-adapted lifting conditions in the v2.5DM framework, we were able not 
only to fix the strong correlation limit, but also improve the quality of the results at inter- 
mediary values of U to IQGT quality. The computational cost, however, is two orders of 
magnitude smaller than when enforcing the full IQGT conditions. It must be stressed that 
up to now we have only included the spin symmetry of the model in our code. If translational 
invariance, parity and pseudospin symmetry are taken into account much larger lattices can 
be considered. As an example, our fully symmetric IQG version allows lattice sizes up to 100 
sites, and the fully symmetric IQGT program up to 20 sites. We expect a fully symmetric 
version of v2.5DM to be applicable to lattice sizes of about 50 sites, thereby enabling us to 
study two-dimensional lattices of reasonable size. 

The diagonality of the third index in the 2.5DM implies that the result will depend on 
the chosen single-particle basis. For the Hubbard model it is clear that the site basis is the 
optimal basis to use for the diagonal third index. It would be interesting to study other 
systems where it is less clear what the best choice of the single-particle basis would be. An 
appealing application, e.g., are molecules, where one can hope to get three- index precision by 
applying the v2.5DM method with a carefully chosen basis. A first guess of what the best 
basis would be is the basis of natural orbitals, for which it has been shown that the fuU-CI 
expansion has the fastest convergence [9j. 



CHAPTER 7 



Conclusions and outlook 



In this work we have provided an overview of a quantum many-body technique in which the 
two-particle density matrix (2DM) is determined variationally, and thereby replaces the wave 
function as the central object. It is our hope that we have convinced the reader that this is 
a promising technique, appealing in the simplicity of its underlying idea, and complementary 
to other many-body methods. At the same time we have tried to indicate where the problems 
and difficulties lie, and shown that the method has a long way to go before it can be used as 
a 'black box' electronic structure method. 

In Chapter [2] we discussed the iV-representability problem, which consists of finding the 
necessary and sufficient conditions a 2DM has to fulfil to be derivable from a physical wave 
function. Although the general problem is nearly impossible to solve, one can derive necessary 
conditions using the dual definition of A^-representability. First we showed how the standard 
two- and three-index conditions can be derived using manifestly positive Hamiltonians. These 
conditions can be formulated as matrix positivity constraints, and are the ones commonly used 
in literature. The second part of the Chapter deals with constraints that are non-standard, 
i.e. the constraint Hamiltonian is not manifestly positive, but a lower bound can be found 
in some different, computationally cheap, way. The formalism introduced in Chapter [2] is 
very flexible, which makes it easy to derive new necessary constraints. It is hard, however, to 
derive constraints that are active (in the sense that the inequality is violated after applying 
the standard conditions), and improve the result for relevant physical systems. 

The constrained optimization problem (v2DM) introduced in Chapter^ can be translated 
to a standard numerical optimization technique called semidefinite programming. In Chap- 
ter^ we start by formulating variational density matrix optimization in the two forms (primal 
and dual) in which a semidefinite program can be formulated. It is seen that in v2DM the 
dual form is the most efficient formulation, and three different algorithms are adapted to the 
specific form of v2DM. Two of these are so-called interior point methods, which stay inside the 
feasible region during optimization. The third one is a boundary point method, which stays 
on the hyperplane where the complementary slackness conditions holds, and moves towards 
A'"-representability. At the end of the Chapter, a comparative study is made between the 
different algorithms. The drawbacks and advantages of the different methods are listed, and 
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their performance for the specific case of the one-dimensional Hubbard model at half filling 
is compared. The boundary point method is seen to be the most suited for this highly sym- 
metrical type of problem. Although a lot of progress has been made, the different algorithms 
all remain far slower than electronic structure methods with a comparable accuracy {e.g. 
coupled-cluster with single and double excitations), and this remains the biggest drawback to 
the method at this point. 

In Chapter[4]we show how the efficiency of the v2DM method can be increased by including 
symmetries specific to the problem under study. We start by explaining how to introduce 
the most commonly available symmetry in electronic structure problems, i.e. spin symmetry. 
Then we consider symmetries specific to certain types of systems, such as rotational symmetry 
in atomic systems and translational invariance in the Hubbard model with periodic boundary 
conditions. It is shown that these symmetries can always be included in a straightforward 
matter, although the practical implementation can sometimes be a technical and tedious task. 
The inclusion of symmetry shows one way to overcome the computational complexity of the 
current implementations of semidefinite programming algorithms, up to a point where they 
are more competitive to other methods. It must, however, be stressed that the advantage is 
limited to the study of highly symmetrical systems. 

In Chapter [5} all different aspects of the v2DM method discussed in the preceding Chap- 
ters come together. The first Section discusses the application of the standard two-index 
constraints to the isoelectronic series of Beryllium, Neon and Silicon. For this study full 
use was made of the available symmetry present in atomic systems, which allows the v2DM 
method to be used on far larger basissets than previously possible. It was found that the 
two-index conditions succesfully describe the static electron correlation arising from the near- 
degeneracy of single-particle levels at higher values of the central charge Z. The quality 
of the variationally obtained 2DM was verified through the extraction of different physical 
properties {e.g. ionization energy) and its physical content was found to be reliable. The 
next Section focussed on the potential energy surfaces of diatomic molecules. It was found 
that the standard two, and even three-index conditions fail to describe the dissociation limit. 
To fix this we derived a new type of A^-representability condition which imposes constraints 
on the separate atomic subsystems of molecule. The last Section treats the one-dimensional 
Hubbard model, for which spin symmetry, translational invariance and inversion symmetry 
allow a study for large lattice sizes. In previous v2DM studies only the half-filled model 
was studied, and only the ground-state energy was considered. We show that the two-index 
conditions fail to describe the strong-correlation limit below half filling, and that three-index 
constraints are needed. Not only the ground-state energy, but also the spin and charge cor- 
relation functions are seen to deviate substantially from the expected result. The origin of 
the failure can be traced to a bad description of correlations in a subspace of three-particle 
space, which is responsible for the hopping on singly-occupied lattices. Two new necessary 
constraints on the 2DM are derived based on this analysis. It is found that these constraints, 
while helpful, are too weak to cure the pathological behaviour, and a larger object is needed. 
In Chapter [6] the 2.5DM, which is a 3DM diagonal in one spatial index, is introduced. Six 
standard matrix-positivity conditions can be derived for this object, and a proof-of-principle 
implementation shows that this approach effectively fixes the strong-correlation limit, while 
being two orders of magnitude less computationally complex than the three-index conditions. 
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Outlook 

Although the method discussed in this thesis has been around since the 1960's, it remains an 
obscure, even somewhat mysterious, many-body method which still needs more investigation. 
Fortunately, activity in this area has picked up in recent years. The single most important 
thing for the advancement of the field is the development of an algorithm that scales genuinely, 
and not just nominally, as 0{M^. If we fail to develop a method that is actually competitive 
with other electronic structure methods, progress will be slow and the method will remain 
in the margins. Apart from the scaling issue, there are a number of directions where I think 
progress is possible in the field, or for which the method could actually contribute information 
where other methods fail. 

• It is notoriously difficult for many-body methods to obtain decent results for the ground- 
state properties of the two-dimensional Hubbard model. This model is highly symmet- 
rical, and a v2DM approach could be used to obtain complementary results to those 
obtained by other methods. Results on the one-dimensional Hubbard model indicate 
that, if one exploits all the symmetries in a v2.5DM approach of the two-dimensional 
Hubbard model, decent results for reasonably sized lattices could be obtained. 

• The subsystem constraints that were introduced for diatomic systems, only become 
active for relatively large internuclear distances. An idea to improve on this is to use 
a previous IQG calculation on the diatomic molecule to construct a better subsystem 
Hamiltonian. 

• One thing that, in my view, has not been exploited enough, is the power of exactly 
solvable models as conditions, possibly in combination with the subsystem constraints. 
E.g. one could imagine using the Bethe-ansatz energies for the one-dimensional Hubbard 
model as subsystem constraints in two-dimensional models. 

• The three-index constraints fix a lot of problems, and generally improve the results 
by an order of magnitude. They are however computationally very heavy to impose. 
Because they are expressible as a function of the 2DM alone, a matrix-vector product 
can be performed very efficiently, and one has the feeling this could be exploited to 
impose the constraints in a much more efficient way. 

• At this point, there is no formulation of reduced density matrix optimization in the 
thermodynamic limit. This is because the reduced density matrix becomes ill-defined in 
the thermodynamic limit. A better suited object would be the related density cumulant 
matrix. It is our ambition to reformulate the density matrix optimization in terms of 
the density cumulant, and use it in a study of the electron gas. 
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Mathematical concepts and notation 



Throughout the thesis mathematical concepts and symbols are used which might be unfamiliar 
to the reader. In this appendix some of these conceps are briefly explained. 

Convexity A subset F of a linear vector space is convex if, for every two elements of this 
set, all elements on the line segment connecting these two elements are also elements of the 
set, i.e.: 

yx,yeV z = ax+{l-a)y eV for 0<a<l. (A.l) 

Convexity shows up in the discussion of ensemble A^-representability, where an ensemble 
A^-representable pDM. is defined as: 

^r«i...a,;ft.../j, = X]«^i (^fl4i---<a/3p---a/3il^f) , (A.2) 

i 

in which 

Wi > and N^ Wi = 1 . (A. 3) 



The weighed sum of two A^-representable pDM's is also expressable in the form (A.2), and 
is by definition also a pDM, which means that the set of ensemble A^-representable pDM's is 
convex. 

p-particle matrix space p-matrix space is the linear vector space formed by symmetric 
matrices on p-particle space. We define a scalar product of two p-matrices A and B as: 

^^'^ Qi...Op/3i.../3p 

The trace of a p-matrix is given by: 

Tr ^ = — 2_^ ^«i...ap;ai...ap , (^-5) 

^' ai...ap 
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and the norm of a p-matrix is defined as: 



\\A\\ = VTr AA . (A.6) 

Carrier space Carrier space is the direct sum of the different constraint matrix spaces, e.g. 
the matrices Z and X appearing in the primal and dual formulation of semidefinite programs 
(see Chapter [3]) live in carrier space: 

Z = A(r) and X = ^Xc,. (A.7) 

i i 

The scalar product and trace on this space is defined as the sum of the scalar products and 
traces on the constraint matrix spaces: 

Tr [AB] = ^ Tr [Ac,Bc,] and Tr .4 = ^ Tr Ac, . (A.8) 



Wedge product The wedge product or Grassman product of a p and a q particle matrix 
is the direct product of these two matrices, antisymmetrized in the single-particle indices, so 
it becomes a {p + g)-particle matrix, e.g. for two IDM's we have: 

(P ^ P)al3;-y5 = Pa-yP^S - PaSPp-y ■ (A. 9) 

Chemical and physical notation In this thesis physical notation has been used for the 
2DM. This means that the single-particle indices on the left of the semicolon form the row 
two-particle index of the 2DM, and those on the right the column two-particle index. In 
chemical notation one groups the single-particle indices belonging to the same 'particle': 

r.ft^^ =TJI (A.IO) 

physical notation chemical notation 

Notation In this paragraph we define some symbols and abbreviations that are frequently 
used throughout the text: 

• The X condition on the 2DM, defined in Chapter [2| is also referred to as the P oi D 
condition in literature. 

• M: The dimension of single-particle Hilbert space, or for chemists, the number of spin- 
orbitals. 

• N: The number of particles in the system. 

• L: The size of the lattice in the one-dimensional Hubbard model. 

• sp: single-particle 

• tp: two-particle 

• pDM: p-particle reduced density matrix 

• v2DM: the variational determination of the 2DM 
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v2.5DM: the variational determination of the 2.5DM 



[j] = V2J + T 

J = VjU + 1) 

(ab) = 1 + 6ab — ^ abbreviation of the norm used in the spin-coupled matrix maps. 



APPENDIX 



B 



Angular momentum algebra 



This appendix contains some of the relations from angular momentum algebra that are used 
in Chapters H^ and ^ For more detailed information on this topic we refer to [63] . 



B.l Spin coupling 

The direct product of two spins ji an J2 is coupled to good total spin J using the Clebsch- 
Gordan coefficients: 

\jiJ2;JM)= ^ {jimiJ2m2\JM)\jimi)\J2m2) . (B.l) 

mim2 

The inverse transformation is given by: 

\jimi)\J2m2) = ^{jimiJ2m2\JM)\jiJ2; JM) . (B.2) 

JM 

Because this is a unitary transformation the Clebsch-Gordan coefficients satisfy the following 
orthogonality relations: 

y^ {jimiJ2m2\JM){jimiJ2m2\J' M') = 5jj'6mm' , (B.3) 

mim2 

^{jimiJ2m2\JM){jim[J2m'2\JM) = S^^^/^Sm^m'^ ■ (B.4) 

JM 

The Clesch-Gordan coefficients can be written in a more symmetrical form, called Wigner 
3j-symbols: 

{Jimd2m2\j3ms) = {-ir-^-^"^nh] (^^^ ^^ _^^J, (B.5) 
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with orthgonality relations: 

(B.6) 

(B.7) 



The Wigner-3j symbol is invariant under even permutations of columns, for odd permutations 
a phase (— l)-Ji+-?2+Js j^jg^g iq |-,g added. The same phase arises when the sign of all nii is 
reversed: 

31 32 k \ ^ ^_-^y^+J2+js f h h h \ _ /ggx 

mi 771,2 ?7i3/ y—fni —fn2 —m^J 

When three spins are coupled there are multiple choices we can make for the coupled 
basis, depending on which two spins we couple to some intermediate spin, e.g. 

U1J2J3; (J23) JM) = Y, Y {3imiJ23M23\JM) 

■miM23 rn2m,s, 

{32m2hm3\J23M23)\jimi)\J2m2)\J3m3) , (B.9) 

\jlJ2J3]{Jl2)JM) = X] X] {jimiJ2m2\Jl2Mi2) 

mim2 m-iMi2 

{Ji2Mi2J3m3\JM)\jimi)\J2m2)\J3'm3) ■ (B.IO) 
The unitary transformation that connects these different coupling schemes is given by: 



\3l32h;iJ23)JM) = 5](-l)^'l+^'^+^'^+^[Jl2][J23] 

J12 

31 32 J12 



, T T I \h32m{Ji2)JM) , (B.ll) 
33 J J23 

in which the Wigner-6j symbol appears, which is symmetric under all permutations of the 
columns, as well as invariant under switching the upper and lower row of two columns, leaving 
the third column fixed. We introduce it here because there are some very useful recoupling 
relations that are employed in the coupling of the matrix maps in Chapter |4] and [6| 

2 /ii 32 3 1 /ji 32 3 1 ^i'r 

pis « '}{s s ''} - S s i-^ '-^' 

and 

' 31 J2 3 \ I 33 3i 3 



V^r.i2lJl J2 3\\31 32 3\ _ "J'J" (j.-,^-. 

^^'^ b3 J. fiU 3. 3"i - FF' ^""-''^ 






32 3a j'l (31 h 3'\ I h 31 f 



y^[/l2(_l)i+i2+is+m' IJ2 J4 J I Jl 33 3 \ I 32 34 3 .g_^4^ 

^ \j3 jl j J 1 777-1 "73 777,7 V"^2 m4 -m' I 
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When four spins are coupled there are even more couphng schemes available, depending on 
which two pairs of spins are coupled first. The unitary transformation relating those coupling 
schemes contains a Wigner-9j symbol: 

\jlJ2J3J4;{Jl3){J24)JM)= Y, [Jl2][J34][Jl3][J24] 

Jl2J'J,4, 

31 j2 Ji2 ^ 

J3 J4 J34>\jd2J3J4;{Jl2){J34)JM) . (B.15) 

,^13 J24 J ) 

There are many useful relations relating these symbols to one another and expressing orthog- 
onality, the only one that is used in this thesis is the reduction of three 6j-symbols to one 
9j: 



Jl J2 J3 
J4 J5 k 

LJ7 is h) i 



B-)=W {---}{- --}{---}. (B.10, 



B.2 Spherical tensor operators 

Spherical tensor operators Am are the generalization of the eigenstates |jm) to operators. 
They transform similarly under a rotation, and the action of the angular momemtum operators 
on them is: 

[J± , AU = y'{j±m + l){JTm)Al^, , and [J, , A^ = mA?^ . (B. 17) 

The direct product of two tensor operators can be expressed as a new tensor operator using 
Clebsch-Gordan operators: 

{A^' ® S^'1'4 = E ^hmxhm2\hm3)^^,Bt^^ • (B.18) 

A very useful theorem is the Wigner-Eckart theorem, which allows to express the matrix 
elements of a spherical tensor operator in a spherical basis, by extracting the dependence on 
the m values: 

(jimi|A^„|i2m2) = (-ir-™^(_^^^ i^ ^^^^ {n\\A^\32) . (B.19) 

The Hermitian adjoint of a spherical tensor operator is not a spherical tensor operator, we 
have to replace it by: 

Bi, = {-\y^-^iA'\^ ■ (B.20) 



Some examples of spherical tensor operators used throughout the thesis are: 

-i\m and hjm = (-l)^+'"a,-^ , (B.21) 



^jm 



which can be used to construct higher-order spherical tensor operators using Eq. ( B.18[ ), e.g. 
the two-particle creation operator: 



4^4 



J3 
ma 



Y^ {jimiJ2m2\J3m2)a\mA2m2 ' (^-22) 



mim2 
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or the particle-hole operator: 



% ^ aj2 



33 



J2 {-iy^^'"'-HjimiJ2m2\hm3)a'.^^^a 



'J2~in2 • 



(B.23) 



mim2 
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c 



Hermitian adjoint maps 



The Hermitian adjoint maps, as defined in Chapter |3j are essential in the formahsm of the 
semidefinite programming algorithms. With every symmetry that is included in the 2DM, 
the Hermitian adjoint maps have a different analytical expression and need to be adapted. 
The specific form of the adjoint maps does not contribute to a better understanding, but for 
one who is interested in implementing the symmetries they are of vital importance, which is 
why they are included in this appendix. 



C.l v2DM formalism 

For every symmetry the Hermitian adjoint maps are still defined by the relation: 

Tr C{r)A = Tr C\A)r . (C.l) 

The X and Q map are Hermitian, so no adjoint map needs to be calculated. In what follows 
the adjoint maps for the G,Ti,T2 and 7^' maps are listed for the different symmetries discussed 
in Chapter [4} 

C.1.1 Spin symmetry 

Since the matrices C{A) are two-particle matrices, the spin-coupled version of the Hermitian 
adjoint maps is defined as: 

^HA)!b;cd = -7r=xri^ E Y.^l<^-l^b\SM){la,lad\SM)£HAU^,^^,,^^d., ■ (C.2) 
^{ab){cd)^^^^2 2 2 2 

The Q' map: the first non-trivial Hermitian adjoint map is the Q\ the spin-coupled form 



can be derived by subsituting Eq. (3.2) in Eq. (C.2) and performing the necessary angular 
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momentum algebra, this leads to: 
1 



QHnXcd 






— [SacAbd + {-ifdadAbc + i-lfdbcAad + hdAac] (C.3) 

^ad;c6 + (~1) Ad;ca + (~1) "^ac^db ~^ Abc;da ■ 



lie 
2 J 2 2 -^ 

1 1 S' 



2 2 



In which the 6ar function for spin-coupled particle-hole matrices is defined as: 

Aac=lY.^S]'Y^A!,.^,,. (C.4) 



The 7^' map: the 7^' map is a Q-like map in A, as is seen from Eq. (3.4), and is therefore 
easily recoupled to: 



I 2Tr A 5 

{SacSbd + i-lrSadhc) T77TT 7T + A 



^/{ah){cd) 

1 1 



iV(iV-l) ' ^^"^""^ 

hdAac + {-ifSadAbc + {-ifSbcAad + ^acAbd • (C.5) 



The difficulty lies in the derivation of the spin-coupled two-particle matrix A from a spin- 



coupled three-particle matrix. This can be achieved by substituting the inverse of (4.48) 
into: 



-tS 



hfT=^ E Y.^\<ya\<yb\SM){\ad\<ye\SM) 



J{ah){de)^ ^'2-"'2 

V V / V / aaCb CTdCTe 



2 2 

/ > -^{aaa){hcTi,)(cac);{duj){eUf,){cac) ' 

C(Tc 

and performing some basic angular momentum algebra to obtain: 



A 



ab:de 



sr [^]^ "s:" aZ{s;s) 



[S\^ 



abc;dec ' 



The double bar function can then be obtained by: 

1 



Aad= ^Y.^si'Y.y^mm A',,.^,, 



(C.6) 



(C.7) 



(C.8) 



The T2 map: the spin-coupled T2 map is derived by subsituting Eq. (3.7) into (C.2). The 



T2 is quite similar to the Q' map, and it is therefore quite straightforward to obtain: 



1 



1 



'^^^^^-o-'^'' yp)M^-i 



SbdAac + {-ifdadAbc + {-lf6bcAad + SacAbd) + ^ffe-cd 



1 



^/{ab){cd) 4; 



Eisf\l I g, 



' 1 5 

2 2 



aS' _l r ^^s as' 



+ (~1) A fj^ + ^cbiad 



. (C.9) 
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The most difficult part is again to construct the relation between a spin-coupled two-particle- 
one-hole matrix and the partially traced matrices in the above equation. Using the same 
strategy as before we can derive the spin-coupled version of the bar function, which maps a 
two-particle-one- hole matrix on a two-particle matrix, as: 



^ab;cd /_^ rop Z^ a 



[Sf 



abc:dec 



(CIO) 



For the tilde function, which maps a two-particle-one-hole matrix on a particle-hole matrix, 



the inverse of Eq. (4.52) is substituted in: 



A? 



^ ^(-1)1— (L,l - a,\ZM){la,l - a.\ZM) 



O'iCTc ae(Tz 



>2 "2 "' "2 2 

/ , ^{aaa)ibat){cac);{aaa){eae){zaz) 



(C.ll) 



Performing two recouplings using Eq. (B.14) introduces two 6j-symbols, and one obtains: 

Z 



Q Q Q L22 ^^ J K 2 2 -^ n 



S{Sab'^Sde) 



^ ^n.h^, 



(C.12) 



The much easier double tilde can be derived by subsituting the inverse of Eq. (4.52) in: 

(C.13) 



Acz — 2_^ Z-^ ■^{aaa){bai,){cc7c)\(acJa)(b(Ji,){zac) ' 
ab O-afTl, 



which immediately leads to: 



^c. = 'E[^]'EEh)^ 



SiSab',Sab) 

abc\abz 



(C.14) 



S Sah a,b 



The T-^' map: the spin-coupled version of the 7^ map is found by substituting Eq. (3.12) 



into (C.2). This results into the spin-coupled regular 7^' derived in the last paragraph and 



some extra terms, which are easily recoupled leading to: 



-'\r A\S 



^2 {^)ab;cd —^2 {^T)ab;cd 



V2 1 

[S] I 7M) 



[i^^)abc;d + (~1) (.^^)abd;c 



+ rr-TT [(^'^)cda;b + (~1) i^^)cdb;a\ 



+ 
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C.1.2 Spin and angular momentum symmetry 

The spin and angular momentum coupled form of the Hermitian adjoint maps is derived in a 
similar way as in the previous Section, by substituting the correct expression in: 



c\A)^; 



s 

cd 



1 



,1 1 



\/P)M)<t 



XI yi {::cra-^crb\SMs){lamalbmb\LML) 



(Tj, niami, 



,1 1 



Yl Y. {i^^ci:cyd\SMs){lcmcldmd\LML) 



OcCTd mcTld 



''- \^){ama(Ta)(bmb(Tb);{cmc(Tc){dmi(Tci) ■ \\j.lb) 

The Q'^ map: the spin and angular momentum coupled form of the Q'^ map is derived by 



subsituting Eq. (3.2) in Eq. (C.15) and performing the necessary angular momentum algebra: 
1 



aUA\^ = ( ^ 



SacKlAn, + {-l)''6adKlMi. 

+ [-l)^8bc5uAH, + ^bdSljM'i. 



Em1l |lj[{ 



^« ^b ^ [ ( A^' ^ A^ 

ic Ld ^ 



bcida 



+ (-1 



xX 



h la L 
Ic 1(1 L' 



A^' + A^ 



bd:ca 



(C.17) 



The har function for a spin and angular momentum coupled particle-hole matrix is automat- 
icly diagonal in the single-particle angular momentum /: 



^1') 



riaric 



m 



712 2^ [^] Z-^^'^b-^'^b ■ 



(C.18) 



X 



The Ti map: the T[ map is a Q-like map of the barred three-particle matrix A, so in spin 
and angular momentum coupled form it becomes: 



-'\( A\X 



Ti i'^)ab]cd 



1 



\/P)M) 
1 



i^achd + (-1) Sadhc) 



X, . ^ 2Tryl ^ 



7P)p)iv-i 



N{N - 1) 



"d 



+ {-ifSadSl,lMtl + ^acSl^lJIll 



with the bar function defined as: 



A^ 



Z-^ [\"12 Z_^ abc;a 



[X? 



(C.19) 



(C.20) 



,y(X;X) 
"dec ' 

in which the variable Y is shorthand for the three-particle quantum numbers L'^S. The double 
bar is derived in the same way a IDM is derived out of a 2DM: 

^i':l = ^ E[^]' E ^A^bmA^b■,cb ■ (C.21) 
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The 7^^ map: the 72 map in spin and angular momentum coupled form is again similar to 
the Q' map: 

"^2 (^)afe;cd 



yp)M^-i 



'SbAjJtX + i-^)''^bcSuA'i± 



+ {-^)''6ad6l,lA'ntl + Sac6l,lJ^i^ 



+ ^ab;cd 






1 1 Q 

2 '-' 

S' 



2 2 



+ (-1) 



^« ^^ ^ U 4^' + A^ 

I I L' ' ^ ^rfa;bc ^ ^cb;ad 



X \h L L \ ( ~x' , TX' 



4 -I- A 

1 1 T' ( \ db:ac ~r ■'~^ca;bd 
Lc id ^ 



(C.22) 



The regular bar function, which maps a two-particle-one-hole matrix on a two-particle matrix, 
is in spin and angular momentum form: 



A^ 

^ab:cd 



E 

Y 



m 



2 abc;dec 



(C.23) 



The tilde function, which maps a two-particle-one-hole on a particle-hole matrix, is a bit more 
complicated: 

. /IK ? 5' 






^' ^ab^de 






abc:aez 



Finally, the spin and angular momentum coupled double tilde function is given by: 

1 1 






X{Xab]Xab) 

abc:abz 



(C.24) 



(C.25) 



-/t 



-/t 



The 72 map: the spin and angular momentum coupled version of the 72 map is again 
almost identical to the 72 1 with some extra terms that need to be recoupled, leading to: 



"^2 {^)ab;cd —%, i^T)ab;cd 



1 



V2 



[[ld]{A^)tbc;d + i-^rM^u.)tM; 



^/{ab) 



+ 



\iQ 



\X: 



(Ic) 









(C.26) 



C.1.3 Translational invariance 

For the one-dimensional Hubbard model, we first include translational invariance to the spin- 



coupled adjoint maps derived in Section C.1.1 The inclusion of this symmetry actually 
simplifies the equations. 
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The Q"^ map: the Q^ map becomes: 






— [SacSbd + i-lfSadSbc] [Aa + ^b] 



(C.27) 



lie 

2 J 2 2 -^ 

lie/ 

2 2'-' 



,S'K„ 



^S aS'K,^ 



ad;cb ^ ' bd;ca ^ ' ac;db bc;da 



where the bar function maps a particle-hole matrix on a single-particle matrix: 



(C.28) 



The Ti map: the 7^ map is a Q-like map of the barred three-particle matrix A, and reduces 
in translationally invariant form to: 



Ti iA)ab]cd 



1 



^ ^ab:cd ■ 



{SacSbd + i-l)^SadSbc) 



2Tt A 



1 



iV(iV-l) N-1 



Aa + At 



(C.29) 



The bar function has exactly the same form as in Eq. (|C.7|). The double bar is different in 

(C.30) 



that it is automatically diagonal in the single-particle momentum: 



1 



^'^ ~ 9 A^t*^] /_^{^^ )Akk';kk' ■ 



The T] map: the T] is also simplified in translationally invariant form: 



t/ A\SK 



ViA) 



"'""' ^{ab){cd)N-l 



{^achd + (-1) ^adhc 



Aa + Ai^ 



A 



SK 
ab;cd 



' Eisfll I i 



^J{ab){cd) 4f 



2 2 






>;ac 



^ ' ca\bd cb;ad 



(C.31) 



The bar and tilde function are again identical to those defined in Eq. (C.IO) and (C.ll), the 
double tilde becomes diagonal in the single-particle momentum k: 



1 



-^'-^Eisi^EE-**' 



Sab ab 



SK{Sab;Sab) 

abk 



-t 



(C.32) 



-'t 



The T2 map: the extra terms added to the regular 7^' map to form the 7^'' map become, 
in translationally invariant form: 

^2 iA)ab]cd =% iAT)ab;cd ~ TTT n-^w [iA^)abc;d + (~1) {A^)abd;c 



[S] V 7M) 



+ 



7P) 



[(^C.)L;6+(-l)^(A 



■'^)cdb:a\ 



+ 



ViMcd)N 



-y [5ac5bd + {-if^adhc) [{Ap)a + [Ap] 



b\ ■ 



(C.33) 
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C.1.4 Translational invariance with parity 



In this Section the parity-symmetric form of the Hermitian adjoint maps is derived. As all 
the adjoint maps are defin ed on t wo-particle space, the pa rity-sy mmetric form can be found 
in the same way as in Eq. (4.129) for < i^ < vr and Eq. (4.130) for K = or tt. 



The Q' map: the parity-symmetric version of the Q' map has the same form as in Eq. ( C.27 ) 



with some minor changes. The whole expression has to be multiplied by the appropriate norms 



'"-^ib ^-^cd^ ^^^ single-particle terms change as: 



A, 



A-k^ 



(C.34) 



with 



E^E 



4 eiV|^'' 



E-4: 



hh'-hh' 



(C.35) 



As was the case for the IDM, k lies in the range < A; < vr, whereas the momentum that is 
summed over, k' , runs over all possible momenta < A;' < 2it. There are two expressions for 
the four remaining particle-hole terms, for < i^ < vr we have to replace: 



ad\cb 



> A - - 

^7r' ad;cb 
ad cb 



(C.36) 



while for i^ = or vr the correct expression is: 



ad\cb 






adicb 



2 GN^j^d gjy^ad 

ad cb 



+ vr 



V A ''^ 



ad:cb 



2 GN^S-d GjSf^'^d 
ad cb 



(C.37) 



-t 



The Ti map: the 7^ map is a Q-like map of the barred three-particle matrix A, and as 



such it changes in the same way as Eq. (4.134) when including parity. The only difficulty lies 



in deriving a parity-symmetric two-particle matrix A from a parity-symmetric three-particle 
matrix A. There are two different expressions, when < i^ < vr: 



-^ab-Ae 



'N^b 'N^e 



E 



\S' 



712 



[S] 



.E 

c 



E.'^, 



S'{S;S)K''' 
abcidec 



ab(b)c de(b)c 



(C.38) 



and when i^ = or vr: 



A 



abide 



'N^b ^Ng 



2-^ \R]'i 



E 



S'{S;S)K"' 
abc\dec 



E.'A 



ab(b)c de{b)c 



+ vr- 



^t' abc;dec 

ab(S)c de(S)c 



(C.39) 
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The T] map: the parity-symmetric expression of the 72 map is also similar to the transla- 



tionally invariant form defined in Eq. (C.31). The difference is that we have to multiply the 
whole expression by the appropriate norm ^N^ ^^^^ replace the single-particle term A^ by 

the parity-symmetric Aj^, the two-particle term A^f^.^^ by A^^^.^^^ and the four particle-hole 
terms by: 



when < i^ < TT and by 



EA^ da y^ A,_ <i°- 

.^ -„„ , ^' da;hc ^-r^' da;bc (C A^^\ 

^d-a;b-c ^ 2 eiV> 5N^d-a +^2 SiV^^-^ gN"""- ' ^ ^ ^ 

da be ^ ^"da be 

when i^ = or vr. The parity-symmetric two-particle matrix A can be derived from a parity- 
symmetric two-particle-one-hole matrix A by performing: 

ro/l2 y- aS'{S;S)K'-' 

aSK^ _ r TuK r lyK \^ ['-' i \^ ^^' abc;dec (T 49^ 

^a6;de ^^ ab ^^ de Z^ r^p Z^ 2 '^^A^^'^; 7^2 iVf/^' ' ^^'^^^ 

S' c ab(S)c de{S)c 



when < -R' < vr or 

\Sk^ _ Tt^jK Tt^yKST^ [^' \~ Y^ I ^tt' ^abe;dee 



\cn2 I V aS'{S\S)k'-^ 



J5A''' _ r 7yri<: r j^tK V^ [^ J V^ ( ^^tt' ""afce;dec 

^ab;de ^^ ab ^^ de /^ r^i2 Z^ I 2 TiJSfS'K' T2^S'K' 



s-^ ,S'(S;S)K"' 

^"^ 2^2 MS' K''''t2nS'K' I ' (C-43) 



when K = IT or 0. The parity-symmetric particle- hole matrix A is derived by: 



0/ c , c , V 2 2 ^^ J L 2 2 



5/ 1 Q^fC' 1 Q 

o O I I O Q o 

I r, 

1 *-'de 



,S'{Sat;Sde)K'' 



EvMw F^'^gg';.... ■ (c-i^) 



2 T2]S[S'K' TijyS'K' 



when < -R' < TT and by 



o/cc L22 ^^JL2 2 rfej 

E V(afe)(«e) ^;^'^g^?^;-^^g,^, +7r^(ab)(ae) ^;^'^g^?^;-^^,^, I . (C.45) 
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when K = OT TT. Finally, the parity-symmetric form of the double tilde function which maps 
a two-particle-one- hole matrix on a single-particle matrix is: 



At. - „ Y.[Sf Y.Y. t^mSK^ ^ ^^bkl 

S Sab 0-b obk vr' 



(C.46) 



-/t 



■/t 



The T2 map: the parity-symmetric form of the T2 rnap is almost identical to the transla- 



tionally invariant expression in Eq. (C.33), the only thing that changes is the A^^ term, which 
has to be replaced by: 



y-^bj) ahc;d 



T.AA 



^Jabc;d 



^n}^ 



for < K < TT, and 



iA.f 



E.'(A 



<^Jabc\d 



abc\d 



T2NI 



Y.AA ^'^ 

ahc "" ' abc 



u 



abcu 



Tajv)'^ 



(C.47) 



(C.48) 



for i^ = or vr. Of course the whole expression has to be multiplied by the appropriate norms 



C.2 V2.5DM formalism 



For the v2.5DM formalsm, the Hermitian adjoint maps are defined through the relation: 

Tr C{W)A = Tr C\A)W , (C.49) 

in which A is a block matrix of the same dimension as C{W), and the traces sum over the 
appropriate indices. There is an additional complication compared to the Hermitian adjoint 
maps of the v2DM formalism, i.e. the matrix A does not necessarily have the right consistency 



symmetry (see Section 6.1.1). For this reason the Q2-iiiap, which is the equivalent of the Q- 
map for the 2DM, is not identical to its adjoint Q/2- The only maps which are Hermitian are 
the Xi and X2 maps. 



The Qg'J^^P- using Eq. (C.49) one can derive the form of the Qg niap: 



Qt / A\l\S{Sab',Scd) 
2\^) \ab;cd 

^SahSrA 



yp)M 



2Tr AI2.5 
N{N-l){N-2) 

\Sa 



V(a6)(cd) 



SbdAac + {-Ir^'^SadAbc + i-ir^' 6bcAad + Sac Am + SacA^ 



bd 



+ {-lf^''5adAbc + {-if^'hcAad + SbdAac + SacA + ^m) 

+{-lf^^6adiAl + Al) + i-lf^^6bciA:, + Ai,) + 6bd{A:, + A^ 



+ 2 i^acSbd + i-ir'^'SadSbc) [A'' + A 

-i- ^r, r, I A^^^ 4- i-^cd \ ( _'\'\Scd A^cd \ ( _'\\Sab A^ab _i_ A^cd \ 
+ '^SabScd \^ab;cd ^ ^ab;cd ^ \ ^) ^ba;cd ^ \ '-) ^dc;ab ^ ^cd;ab) 



+ 



^{ab){cd) L' 



SacAXf + {-^f'^'hcA'fS + (-l)'^"''5adilfr + hdA!'\l^c 



(C.50) 
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where we have introduced a lot of different partial traces of the matrix A. The first term of 



the right in Eq. (C.50) shows the inproduct of A with the unity matrix I2.5! which is not the 



regular trace of A, but rather defined by: 

Tr^i=Ef^E[^]' EE(i + ^-)^ 

I y S SabScd ah 



l\S{SabScd) 

ab]ab 



+2 Y, iSab\[Sc 



^ab^cd 



^ 9 Jab 



cd\\ 1 1 
2 2 



S, 



cd 






l\S(SabScd) 
lb 



There are four types of 'single-particle-like' matrices, given by: 



1 



^'=d-E[^]^EE(«^M' 



l\SiSab',Sab) 

ab-.ab ' 



^^ S Sr,b ab 



^- =ir- E Et^]' E E vMP)^ 



l\S{Sab',Sab) 

ab;cb ' 



IS Sab b 



l\S{Sab',Scd) 

Ibdd 



^""^P 5 SabS.d '-^ 2 ^cd] ^ 

where ngp = {N — 1)(A^ — 2), and three types of 'two-particle-like' contractions: 
js^b ^ 1 v^ \Sf sr^ j,us{Sab\S,d) 



(C.51) 

(C.52) 
(C.53) 
(C.54) 
(C.55) 



A 



abicd 



^ \^lQ]2Sr^\Sab\ j S 2 Sab\ .a\S{Sab;S^d) 



(C.56) 
(C.57) 



S Sab 

^^i'^=^E[^]' Ei^'^di^c.]!? ! fUf ! f3EA/(^^^^K^^'if&^'^''^ 

"-tp ^ c. Q I 2 2 "^afej L2 " 



Jah^^i 



ab'^cd 



2 *-^cd 



(C.58) 



with ntp = N — 2. 



The Q|-map: this maps a block-diagonal two-hole-one-particle matrix A, on a block- 
diagonal three-particle matrix, its explicit expression is given by: 



y(a6)(cd) 



2A 



N{N -1){N -2) 



^ab^c< 



7P)p) 



SacA-bd + {-if'^SadAc + (.-lf''''hcAad + hdA-ac + hd {K^ + A^^) 
+ (-l)^-<5ad (^L + Al) + (-l)^-5b, {Ai^ + Al,) + 5ac {At + A\ 
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•Jab^ci 



\/P)M 



1 a I Sab 



rb|5, 



r A0.\Oab _i_ (_-\\babX, Abl'^ab , (_-l\i>abX , A'^l^o.b , !! Ablba 



+ '^s..s., fif,^;',, + i-if^'At,, + (-i)''"^il"^.6 + ^fl 



ab 

^cd\ab 



^/^^ \Scd s' if^Ua 



(C.59) 



where many zero-, single-, and two-particle contractions are introduced. There is only one 
zero-particle contraction, given by: 



^= E [Sab][Scd]Y.{lb)A 

Sabred 



/| 2i^ab;Scd) 

Ibdb 



(C.60) 



lb 



There are three types of single-particle contractions of A, similar to those defined for the 
Q2-ioaap, but not the same: 






l\ 2^^ab]Scd) 

Id 



2n. 



'ab:ab 



sp 



ab 



^-=^ E E\/(«^MAc6 



(C.61) 
(C.62) 
(C.63) 



SabScd b 

and two two-particle contractions: 



A^cd 
^ab:cd 



[^ab\ .a\2^^<^>''^<:d) 
„ , ^ re 1 \ab:cd 



00 1 

^'l^'^ ~ E[^]' E [SabWScd] \ Scd t U E ^/iMad)A 



ntp 



Sabred 



l\S{Sab\Scd) 

ab;ad 



(C.64) 
(C.65) 



S' 



The Ql-map: this maps a block-diagonal one-hole-two-particle matrix A, on a block-diagonal 



three-particle matrix, using Eq. (C.49) with Eq. (6.47) one can derive the explicit form: 

^SabScd 



Q^i f A\S(Sab]Scd) 



'ab:cd 



, , ,,, „ SacAbd + {-ir'^'dadAbc + i-iy ''"dbcAad + SbdAa 

y/{ab){cd) L 
+5bd{Al, + ~Ala) + {-\f-'Kd{A\, + i^,) + ^-if-'bbMl, + A^a) 

^baAA\^ + A^db) + {^ac^hd + {-^f^'^ad^b:) (^ + i' 



+ 



A/(«b)M) 



\a\Sa 
\bc 



bacAXf + (-l)*"''(^ad^ir + (-l)''"''J6cAXd' + '^M^' 



•Sab 



\b\Sab 

\ad 



\b\Sab 

\ac 



-6bd (i'lfc"" + i'lfa"') - (-l)^"'''^ad (ilfr + i' 



4a I "Sab 

c6 



iblSa 



ib\Sat 



-i-lf^b6bc(ArS + A'^,^ 



A^ab _i_ A^ab _i_ ^-^ab _i_ 4'^ab 

^ad:cb "T ^c6;ad "•" ^bc;da "•" ^da;bc 



6arA A'^lff + A''^^-' 



\bd 



\db 
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f—']\^ab ( AJab _i_ 4 "Sat. _i_ AJab _i_ A 



y^{ab){cd) ^ ^-^ 



yZ yAS'?[Sab][Scd][SM][Sdl]\l ! ^ 

2 2 '-'<^b 



S' StiSdi 



•J 9 ^di I J ^-^ 9 '-'bl I I ^-^ '-'W 9 



1 1 cf^c'c 1 

2 2 '-'crfj I'-' '-'oii 2 



\^^ lad 



{Sbi\Sdi) 



ad:cb 



+ (-1)*"M' 



■Sab /|i|'S"('S6;;5d!) 



Ibd:ca 



+ {-ly'^'A 



Scd Ah^ {Sbi]Sdi) 



lac;db 



+ (-1)' 



Sab+Scd Al\S'iSbi;Sdi) 



'A 



There are three single-particle type contractions, 



lbc:da 



(C.66) 






l\ ■2(!^bi;Sdi) 



\bl:dl ' 



' "St! Sdl 






1 1 2(^l'l]Sdt) 



ac ' 



SbiSdt 



A' 



2n, 



sp 



E[^''^][^'^'](-i)''"^''"E^' 



Sbi+SdiSr /i/|i('^w;5dO 



|aa;cc ; 



5'faz Sdl 



(C.67) 
(C.68) 
(C.69) 



and three types of two-particle contractions: 



\l\Sbi 
^ \bd 

\Sd 



J_ V ["^J^ V A^l^^^"'^"^ 



ab;cd „ / ^ re 1 \ab;cd 
'Hp „ L^cdJ 

'-'ab 



Sdl Sr^ Aiih(^bi;Sdi) 

aa\cd ' 



[^ab\ Aa\2^Sab\Scd) 



(C.70) 
(C.71) 

(C.72) 



The Q^-Tociap: this maps a one-particle-two-hole matrix A on a three-particle matrix. We 



derive its explicit expression by substituting Eq. (6.48) into Eq. (C.49): 



(->] ( A\S{Sab;Scd) 

y2\^)ab:cd 



Ss„. 



x/P)P) 



6bd{Aac + Kc) + {-ir'"'6ad{Abc + Al) + {-l)^^''6bMad + K 



ad) 



+6ac{AM + i;j - AXb 



\~'-) ^ac-.db ~ \~^) ^b 



\ Sab A Sab ASab 

^ac\db y ^' bd:ca bc:da 



b|5, 



ab 

ac 



1 a I Sab 



-(-1)^-<5„,A"„, 



1 a I Sab 



{ — ^'\^abX, Af\^o.b_s: Aai^ab 
\ ij Obc^ lad ^ac^ \i,d 



1 "l ^ab A Sab 



5bdA 



l_Y\Sab^Sab 



bd\ca 



ASab ^Sab ASab 



ad\cb 



bc:da 



(_^\SabASab _ ( _^\Sab A 

V ^) ^db:ac \ ^) ^ 



Sab ASab 



ST^ ST^ rc/i2 



V(«^)M) S' S,,Sdi 



[Sr[Sab][Scd][Sbi][Sdi] 



s 



Sdl 



2 2 '-'afe 



4' 

S K Sbi 



2 2 



Scd j \S' Sdl 2 



S Shi 2 



X A 



\l\S'{Sti\Sdi) 
'adicb 



+ (-1)*"^' 



Sab AU^'(^>>l\Sdl) _|_ /_-j^\Scd^/|'S"(5'6i;5di) _|_ /_'^\Sab + Scdj^l\^'(^bi;Sdl) 



'bd\ca 



'bc:da 



(C.73) 
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There are two types of single-particle contractions: 

1 



A„ 



2n, 



sp 






I s 



^ac=^EE[^]'E(-i)'"^'i2fr^""^ 



(C.74) 
(C.75) 



sp 



/ 5 



Sbt 



There is a prime added to the breve contraction A in Eq. (C.75), because it is slightly dif- 



ferent than the breve contraction appearing in Eq. (C.67). There are also three two-particle 
contractions, given by: 



1 1 






S') 



I s 



Sbi Sdi 



Sbi 



> A 



1 1 

2 2 
1 1 or' 

^'i':=i^E[^]'E[^^'n5d(-i)""-^"'^'J ^ ^'■- ' 



l\S(Si,i;Sdi) 
\ab:cd 



ntp 



SbiSdt 



ell 
Jbl 



E-4' 



l\SiSbi;Sdi) 
ab;cb 



1 1 



2 2 J 

S' 



^f;c.=-^E[^]'E[^^^][^'^'](-i)'"U ^^^ O^'Sc^^'^ 



ntp 



SbiSdt 



Sbi 



1 1 

2 2 



(C.76) 

(C.77) 
(C.78) 
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